10/511436 

DT05 Rec'd PCT/PTO £5 OCT 2004 



WO 03/09J43J 




PCT/JP03/05464 



SEQUENCE LISTING 
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chain 

<130> PH-1796-PCT 

<150> JP 2002-127677 
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<160> 120 

<170> Patentln Ver. 2. 0 

<210> 1 
<211> 11 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 1 

Ala Tyr Met Phe Lys Tyr Asp Ser Thr His Gly 
1.5 10 

<210> 2 
<211> 11 
<212> PRT 
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<213> Saccharomyces cerevisiae 



<400> 2 



Asp Gly Pro Ser His Lys Asp Trp Arg Gly Gly 



1 



5 



10 



<210> 3 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer PGP5 for amplification 
5' -region of Ogataea minuta GAP gene 



<210> 4 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial, Sequence: primer PGP3 for amplification 
3' -region of Ogataea minuta GAP gene 



<400> 3 



gcntayatgt tyaartayga ywsnacncay gg 



32 



<400> 4 

ccnccnckcc artcyttrtg nswnggnccr tc 
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<210> 5 
<211> 3186 
<212> DNA 

<213> Ogataea minuta 

<220> 

<221> CDS 

<222> 1492. . 2502 

<400> 5 

aagctttact ggttcaaggg gttaagtagg ggcgcggtct ggtctttgtg g.ttgtttcta 60 

cacggaccac agttgacagc atcgactgct catcgaaaac ggtcgcagtg cggcaatctg 120. 

ctctatctaa tcccaggcta ctcgatccct gcacaaccta cagagtgatc cgaccgcact. 180 

gcccgagatt cagcagactc tcgcagcgca gcgtgcgttt taatccctca aatcaaggct 240 

gtgcagaccc ggaggatgtg aagctgggac ggcgggaggg aagtctggag tggtgagaga 300 

atgtgggagc tgtgcaaagg ggcaatggtc actcagcgca gagcgatggt ggcgcggggg, 360 

ccaatatctc ggcaacaaga acgcccgagg acgacgggac tctgaatgcg agcacgttgt 420 

ctttcagaca gtccacccgg attccaatat tcgcaggact cgcgctcaga aacgcaaccc 480 

cggcagattc gcgtccagtc aggccatctg cggcgagctg ctgcgctcgc gggctgcgcc 540 

acaacgcatc gccacatata cgtcaccgcc cgcccgctgg caacctgagg tttttccgca 600 

acgggtgcac tgattgctgc gttaacgagg caactggaga tgtcagaggc. caagtggagc 660 

catatcacag cggactgcgc atctctggcc tgccggacgc ggtagcgtcc cgtctttttg 720 

cggacagctt cttaaaacct ggctgaaact aagcgagacc tgcgacctgg aacgcccgca 780 

cacccgtaca cctccggagt tgtatcctca gaagcggagt aacctgcagg cctacgcaag 840 

aaaagagccc gggacccatc gaccggaaaa gaggggtgga gctagtgggg tagccttgga 900 

gcagacctgg ggcagacctg ggttagtacc agggccgaaa agggtcagag gaatcagggt 960. 

ggcacggcag tctataccgt agaagctctt ctcgacagca gcgagcagaa actgcacaga 1020 

ggtccgttcg ccagtctcgt accaccaccg catgacccaa tcagcattga tgctcccaca 1080 
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tgggtagtgc gcgcgaacgc ctggcaccca 
tggttaacac tggcccggag tagtcatata 
tcgggacacg acctaagtgg cgtgcaaagc 
gcgactcttg tgcgcgcccg cggcggttcg 
cagcgagcga tggtgtgcca gatctcaggc 
gtccactttt ctccaatagt atttgacccg 
tacaattatt gtttgaattc actcaattaa 
aacgtcggta tcaacggatt cggaagaatt 
agaaaggaca tcaacgtggt tgccgtgaat 
tacatgttca agtacgactc cactcacgga 
aagtaccttg tgatcgacgg tcagaagatt 
atcccatggg gtaaggaggg cgttgacttt 
accgccggcg ctcaaaagca cattgatgct 
tccgctgacg ctccaatgtt cgttatgggt 
tccattgtct ccaacgcttc ctgtaccacc 
aacgacgttt tcggtattga gtctggtttg 
caaaagaccg ttgacggtcc atcccacaag 
aacatcattc catcctccac cggtgccgct 
gctggtaagt tgactggtat gtctctgaga 
ttgactgtca acttgaagac cccaaccacc 
gcctctgagg gtgaacttgc cggtatcttg 
gacttcttga ccgacaacag atcttcgatc 
ccaactttcg tcaagttgat ctcctggtac 
gtcgacttgc ttgagcacgt tgccaaggtc 
gcctctttaa gtaaacattt cgttttgaat 
tgtagactta ctccgaagct ccggggcttc 
caccggaaaa ctggctattt gaaaaatttc 
cattgccaaa ttcgatcgtt ttctaacgga 
catcaattgt cctctattcc tttttggccg 
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aacacaccac ttacgcttcc cgcaccgcgg 1140 
cggagatttt ggcatgattc taattccggg 1200 
tcgggggcta aatgtttccc ggcgctcgcg 1260 
cgggagacgg gggaaagaga ggggtga'ccg 1320 
cgagtcaaga caatatataa agagaggatt 1380 
ggttgctctc tgttgatttt ttctagatca 1440 
catacacaaa tacaatacaa aatggcttac 1500 
ggtagactcg ttcttagaat tgctttgtcc 1560 
gatccattca tcgctgccga gtacgctgct 1620 
agataccaag gtgaagtcac cttcgaggga 1680 
gaggtgttcc aagagagaga ccctgctgac 1740 
gtcattgact ccaccggtgt gttcaccacc 1800 
ggtgccaaga aggttatcat cactgctcca 1860 
gtcaaccaca aggagtacac caaggacttg 1920 
aactgtctgg ctccattggc caaggttgtt 1980 
atgaccaccg tccactctat cactgccacc 2040 
gactggagag gaggaagaac cgcttccggt 2100 
aaggctgtcg gtaaggtctt gccagctctt 2160 
gttcctacca ccgatgtttc cgttgttgac 2220 
tacgcagaga tctccgccgc catcaagaag 2280 
ggttacactg aggacgccgt tgtctccact 2340 
tttgacgcct ctgccggtat cttgttgacc 2400 
gataacgagt acggttactc caccagagtt 2460 
tcttccgctt aagtggatag atgaccaatg 2520 
atatttcaag ttgaataatg aaagccttgt 2580 
ggctccctga atttattttt tacatctctg 2640 
gacgttttgc ttgaaactcg agttgaggag 2700 
cgccagtcga gttattgtta tgtcacgtga 2760 
atctcgtttg tgctgacggc ctccgaacag 2820 
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ttacttctac cggcagggat tggggatgat cgggatcgat gtcctcaact ccagaggctg 2880 

atccgatgcg gtgggacttc atgcgtccaa atctgttgga tgatgtgctc ttctgctttt 2940 

ttggtgacca aacgagatga caattgactg cattgaaaag gttattagct tttttggtct 3000 

tctcctgtgt cgattcgagc ggtaccgtag gtaggtctgc tatggaggca tgcgtcataa 3060 

gtcagccttg attaactttc ggagctgcgc gatccacatc tctgcaccgc gcggaggcct 3120 

ttgactgcag cattttaatt aatctcgtaa aataagctct taaacgagat tagcttacgg 3180 

ggatcc 3186 



<210> 6 
<211> 336 
<212> PRT 

<213> Ogataea minuta 



<400> 6 

Met. Ala Tyr Asn Val Gly He Asn 
1 5 

Val Leu Arg He Ala Leu Ser Arg 
20 

Asn Asp Pro Phe He Ala Ala Glu 
35 40 



Gly Phe Gly Arg He Gly Arg Leu 
10 15 

Lys Asp He Asn Val Val Ala Val 
25 30 

Tyr Ala Ala Tyr Met Phe Lys Tyr 

45 



Asp Ser Thr His Gly Arg Tyr Gin Gly Glu Val Thr Phe Glu Gly Lys 

50 55 60 

Tyr Leu Val He Asp Gly Gin Lys He Glu Val Phe Gin Glu Arg Asp 

65 70 75 80 
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Pro Ala Asp lie Pro Trp Gly Lys Glu Gly Val Asp Phe Val He Asp 

85 90 95 

Ser Thr Gly Val Phe Thr Thr Thr Ala Gly Ala Gin Lys His He Asp 
100 105 110 

Ala Gly Ala Lys Lys Val He He Thr Ala Pro Ser Ala Asp Ala Pro 
115 120 125 

Met Phe Val Met Gly Val Asn His Lys Glu Tyr Thr Lys Asp Leu Ser 
130 135 140 

He Val Ser Asn Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Leu Ala 
145 150 155 160 

Lys Val Val Asn Asp Val Phe Gly He Glu Ser Gly Leu Met Thr Thr 
165 170 175 

Val His Ser He Thr Ala Thr Gin Lys Thr Val Asp Gly Pro Ser His 
180 185 190 

Lys Asp Trp Arg Gly Gly Arg Thr Ala Ser Gly Asn He He Pro Ser 
195 200 205 

Ser Thr Gly Ala Ala Lys Ala Val Gly Lys Val Leu Pro Ala Leu Ala 
210 215 220 



Gly Lys Leu Thr Gly Met Ser Leu Arg Val Pro Thr Thr Asp Val Ser 
225 230 235 240 
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Val Val Asp Leu Thr Val Asn Leu Lys Thr Pro Thr Thr Tyr Ala Glu 
245 250 255 

lie Ser Ala Ala He Lys Lys Ala Ser Glu Gly Glu Leu Ala Gly lie 
260 265 270 

Leu Gly Tyr Thr Glu Asp Ala Val Val Ser Thr Asp Phe Leu Thr Asp 
275 280 285 

Asn Arg Ser Ser He Phe Asp Ala Ser Ala Gly He Leu Leu Thr Pro 
290 295 300 

Thr Phe Val Lys Leu lie Ser Trp Tyr Asp Asn Glu Tyr Gly Tyr Ser 
305 310 315 320 

Thr Arg Val Val Asp Leu Leu Glu His Val Ala Lys Val Ser Ser Ala 
325 330 335 

<210> 7 
<211> 1491 
<212> DNA 

<213> Ogataea minuta 
<400> 7 

aagctttact ggttcaaggg gttaagtagg ggcgcggtct ggtctttgtg gttgtttcta 60 
cacggaccac agttgacagc atcgactgct catcgaaaac ggtcgcagtg cggcaatctg 120 
ctctatctaa tcccaggcta ctcgatccct gcacaaccta cagagtgatc cgaccgcact 180 
gcccgagatt cagcagactc tcgcagcgca gcgtgcgttt taatccctca aatcaaggct 240 
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gtgcagaccc ggaggatgtg aagctgggac ggcgggaggg aagtctggag tggtgagaga 300 
atgtgggagc tgtgcaaagg ggcaatggtc actcagcgca gagcgatggt ggcgcggggg 360 
ccaatatctc ggcaacaaga acgcccgagg acgacgggac tctgaatgcg agcacgttgt 420 
ctttcagaca gtccacccgg attccaatat tcgcaggact cgcgctcaga aacgcaaccc 480 
cggcagattc gcgtccagtc aggccatctg cggcgagctg ctgcgctcgc gggctgcgcc 540 
acaacgcatc gccacatata cgtcaccgcc cgcccgctgg caacctgagg tttttccgca 600 
acgggtgcac tgattgctgc gttaacgagg caactggaga tgtcagaggc caagtggagc 660 
catatcacag cggactgcgc atctctggcc tgccggacgc ggtagcgtcc cgtctttttg 720 
cggacagctt cttaaaacct ggctgaaact aagcgagacc tgcgacctgg aacgcccgca 780 
-cacccgtaca cctccggagt tgtatcctca gaagcggagt aacctgcagg cctacgcaag 840 
aaaagagccc gggacccatc gaccggaaaa gaggggtgga gctagtggjgg tagccttgga 900 
gcagacctgg ggcagacctg ggttagtacc agggccgaaa agggtcagag gaatcagggt 960 
ggcacggcag tctataccgt agaagctctt ctcgacagca gcgagcagaa actgcacaga 1020 
ggtccgttcg ccagtctcgt accaccaccg catgacccaa tcagcattga tgctcccaca 1080 
tgggtagtgc gcgcgaacgc ctggcaccca aacacaccac ttacgcttcc cgcaccgcgg 1140 
tggttaacac tggcccggag tagtcatata cggagatttt ggcatgattc taattccggg 1200 
tcgggacacg acctaagtgg cgtgcaaagc tcgggggcta aatgtttccc ggcgctcgcg 1260 
gcgactcttg tgcgcgcccg cggcggttcg cgggagacgg gggaaagaga ggggtgaccg 1320 
cagcgagcga tggtgtgcca gatctcaggc cgagtcaaga caatatataa agagaggatt 1380 
gtccactttt ctccaatagt atttgacccg ggttgctctc tgttgatttt ttctagatca 1440 
tacaattatt gtttgaattc actcaattaa catacacaaa tacaatacaa a 1491 

<210> 8 
<211> 524 
<212> DNA 

<213> Ogataea minuta 
<400> 8 

gtggatagat gaccaatggc ctctttaagt aaacatttcg ttttgaatat atttcaagtt 60 
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gaataatgaa agccttgttg tagacttact 


ccgaagctcc ggggcttcgg 


ctccctgaat 


120 


ttatttttta catctctgca ccggaaaact 


cectattt ea 


<-lUOO U U ^ O fcj Ct 


cgttttgctt 


180 


gaaactcgag ttgaggagca ttgccaaatt 


cea t cet ttt 




ccagtcgagt 


240 


tattgttatg tcacgtgaca tcaattgtcc 


tctattcctt 


tttggccgat 


ctcgtttgtg 


300 


ctgacggcct ccgaacagtt acttctaccg 


gcagggattg 


gggatgatcg 


ggatcgatgt 


360 


cctcaactcc agaggctgat ccgatgcggt 


gggacttcat 


gcgtccaaat 


ctgttggatg 


420 


atgtgctctt ctgctttttt ggtgaccaaa 


cgagatgaca 


attgactgca 


ttgaaaaggt 


480 


tattagcttt tttggtcttc tcctgtgtcg 


attcgagcgg 


tacc 




524 


<210> 9 










<211> 113 










<212> DNA 










<213> Artificial Sequence 










<220> 










<223> Description of Artificial 


Sequence: primer for 


production 


of . 



expression cassette with GAP gene promoter and terminator from Ogataea minuta 
<400> 9 

gtttgaattc actcaattaa catacacaaa tacaatacaa agtcgacaaa aaatgcatgt 60 
ggatagatga ccaatggcct ctttaagtaa acatttcgtt ttgaatatat ttc 113 

<210> 10 
<211> 38 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Descript ion of Artificial Sequence- primer for production of an 
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expression cassette with GAP gene promoter and terminator from Ogataea minuta 
<400> 10 

tttttactag tacggtaccg ctcgaatcga cacaggag 38 

<210> 11 , - 

<211> 12 

<212> PRT 

<213> Saccharomyces cerevisiae 

<400> 11 

Gly Pro Tyr lie Cys Leu Val Lys Thr His lie Asp 
1 5 10 

<210> 12 
<211> 11 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 12 

Gly Arg Gly Leu Phe Gly Lys Gly Arg Asp Pro 
15 10 

<210> 13 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: primer PUR5 for amplification of 
5'— region of Ogataea Minuta URA3 gene 

<400> 13 

ggnccntaya thtgyytngt naaracncay athga 35 

<210> 14 
<211> 32, 

<212> DNA . . 

^ <213> .Artificial Sequence 

<220>. 

<223> Description of Artificial Sequence: primer PUR3 for amplification of 
3' -region of Ogataea Minuta URA3 gene 

<400> 14 

ggrtcncknc cyttnccraa narnccnckn cc 32 

) <210> 15 

<211> 3113 
<212> DNA 

<213> Ogataea minuta 

<220> 

<221> CDS 

<222> 1732. .2529 

<400> 15 

gccgcggccg ctgctgctgc ttccactaaa acagcaacga gcaacgcgtc tgccgaaaac 60 
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tccttgaatc aagacctgga caatcatttg acgttgggca gagagcattt cgacaccact 120 
gaggttccta ccgcggacgg gtccaaagtg gaggttctcc gaaacatgtc ' tgtcgagacg 180 
ggtcctgccg acgatcttaa caaaaacccc tccaccagcg agctggtgca tctggaggaa 240 
aaatcacagg aaagcgcatc cgaggaagag gtcaggacct cgaaccatgc cgacacagcc 300 
ggaacagaac caggtccaga acacgtccat ggcaacgata aagcggaggg cgagggcgag* 360 
tcctcagaag atgaccagga aatggtggac gctccactgc ctccttcgga cgataaggag 420 
actgagaacg cgctgccgac ggagactaaa gtggagtcga ccaaagacga tgtagaccag 480 
gaagaagagg aagaagagga ggaagaggaa gaaacagtac ctttccaagt ctctaaaaag 540 
gtatccaagg aggaagaaat ttcagctccg acgccggagc ccactacgcc tacgtcggcg 600 
aacgagagcg aggaggaagg cgataccagg ccccggaaaa ggcggcg'gtc ggagtcgatt 660 
tcggccgcct ccagcaagag atttttggct cttggtactc aactgttgag ccaagtttcg 720 
tcgaatcggt ttgcgtcgat gtttttgcag ccagtgaaca aaaacgagga gcctgagtat' 780 
tacaagctca tccaccagcc gatcgatctc • aagacgctgt • cgaagtcggt ccgaaccggc 840 
gagattcagt cgttcgatga ccttgagttc cagctgcaac tcatgttcag ;caatgcaatc 900 
atgtacaacg acacctacca gacggaaacg tacaaatgga cgatcgagat gatggaggaa 960 
gcccagaatc tgattgaaat gttcagggaa acttccaaca actgagatca actgcgacta 1020 
cttctgttgg ctggctggac gggttgtatt actatcttgg acaacgctat gtaaccttat 1080 
ctaaatacaa gaattcatgt acaaaatcat ttgtgcgggc gcagagacga gcgacgagtt 1140 
gccgaaatca cccggctgct cagttaccac ctctcatttg gttcatgagc atttgattct 1200 
gctcctggaa tctagatccg actctctcac tgtgcttgag gaacttctca gcacacttgt 1260 
tcaaacaggt ctcctctctg gagctgagct tgttggaggt gaagtcattg acacagtcgt 1320 
tgaaacatct gtcgacaaga ttggtgtaca actggggcaa aataatgtta' gtcgtggttc 1380 
atcaaaggct cgacgtcatt ttgctgtctc tagtaactta ccctcatgaa gtcgttcatc 1440 
tgcttctgct cgacgatttt ctggaattcc tgttgttctt tgtagttgag ttgatccatt 1500 
ttgctgtttt tctagttctg ctttgctaga ctgttggcca atatctggtt atccctctag 1560 
cttatcgtgg agaagggtgt ttttttgcta ccaaaagctg aaaattctga " aaaattttcg 1620 
gatttgaatt tttttttacc cggcactttt tgaccccata ctagttgtac caaactgaaa 1680 
gagactgcag ttggtctttg cggggagatt ttggcagata aacaggcgac tatgtcctcg 1740 
actaagacat acgcgcaaag ggcggcggct catccgtcga ctgtggccag aagactgctg 1800 
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aacttgatgg aatccaagaa gacgaacttg tgtgcctcgg tcgatctcac ctctacaaag 1860 
gaccttttgg agctgttgga caagctggga ccgttcattt gtctggtcaa gacacacatc 1920 
gacattgtgg aagacttttc gtacgaaaac accgtggtgc cgctgctgaa actggccaag 1980 
aaacacaact tcatgatctt cgaggaccga aaatttgccg atataggcaa caccgtcaaa 2040 
ctccagtaca agggaggagt ttaccaaatc gcaaagtggg ccgatatcac caacgcccac 2100 
ggagtgaccg gctcgcgaat tgtctcgggt ctcagacagg ctgcccagga gaccaccgac 2160 
gagccaagag gtctgctcat gctggctgag ctgtcgtctg aaggctcgct cgcgtacgga 2220 
gagtacacca aaaagacggt tgaaatcgca aagtccgaca gagattttgt gatcggtttc 2280 
attgcgcaaa acgacatggg tggccgcgat gagggcttcg actggctcat catgacccca 2340 
ggtgtcggac tcgacgacac jCggtgacgct ctgggccagc agtaccgcac ggtcagcgcc 2400 
gttatgaaga cgggaactga catcataatc gtgggcaggg; gactgttcgg caagggaaga 2460 
gaccctgtcg tggaaggcga aagatacaga aaggctggat gggacgctta tttgagtcgt 2520 
gtcgcatgat ttcgggtcac gtgactatat agctattggt atgtacaaga attaattagc 2580 
ggagtttgtc gccaaactct tcggccaact cgatgctcag tttctggcgt gaaatttcga 2640 
acaccagcag cccgatggag gtagccggta gacttgttgt tgcagttctc gcgaatcccc 2700 
tgtagaagaa gcccagtagg gagagatggg acttgcggta tctggtcatc atgatttcga 2760 
aagtttcgag gtatgaattg tagtagagct taaagaaacg gcttctctct agatggtggg 2820 
cctcgttgta cagatcaagc gactccagtc tggacagatg gaccttctgg attttgttga 2880 
acggaaattg gattgccagc agggttgtgg cggcactggc tccagccaaa agaatgaagg 2940 
tcagccggag agctttgatc gatttcgagt gtttgtccag gtccgggttc ttctctccgt 3000 
ataacagacg ggctttccag tactggtacc agtttatcat gctctgagtt ctgtggaagc 3060 
cctggttttt cacaaactca aacacggaga agtagaacgc aaacccaaag ctt 3113 

<210> 16 
<211> 265 
<212> PRT 

<213> Ogataea minuta 



<400> 16 
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Met Ser Ser Thr Lys Thr Tyr Ala Gin Arg Ala Ala Ala His Pro Ser 
1 5 10 15 

Pro Val Ala Arg Arg Leu Leu Asn Leu Met Glu Ser Lys Lys Thr Asn 
20 25 30 

Leu Cys Ala Ser Val Asp Leu Thr Ser Thr Lys Asp Leu Leu Glu Leu 
35 40 45 

Leu Asp Lys Leu Gly Pro Phe lie Cys Leu Val Lys Thr His lie Asp 
50 55 60 

He Val Glu Asp Phe Ser Tyr Glu Asn Thr Val Val Pro Leu Leu Lys 
65 70 75 80 

Leu Ala Lys Lys His Asn Phe Met He Phe Glu Asp Arg Lys Phe Ala 

85 90 95 

Asp He Gly Asn Thr Val Lys Leu Gin Tyr Lys Gly Gly Val Tyr Gin" 
100 .105 110 

He Ala Lys Trp Ala Asp He Thr Asn Ala His Gly Val Thr Gly Ser 
115 120 125 

Arg He Val Ser Gly Leu Arg Gin Ala Ala Gin Glu Thr Thr Asp Glu 
130 135 140 



Pro Arg Gly Leu Leu Met Leu Ala Glu Leu Ser Ser Glu Gly Ser Leu 
145 150 155 160 
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Ala Tyr Gly Glu Tyr Thr Lys Lys Thr Val Glu lie Ala Lys Ser Asp 
165 170 175 

Arg Asp Phe Val lie Gly Phe lie Ala Gin Asn Asp Met Gly Gly Arg 
180 185 190 

Asp Glu Gly Phe Asp Trp Leu He Met Thr Pro Gly Val Gly Leu Asp 
195 200 205 

Asp Thr Gly Asp Ala Leu Gly Gin Gin Tyr Arg Thr Val Ser Ala Val 
210 215 220 

Met Lys Thr Gly Thr Asp He He He Val Gly Arg Gly Leu Phe Gly 
225 230 235 240 

Lys Gly Arg Asp Pro Val Val Glu Gly Glu Arg Tyr Arg Lys Ala Gly 
245 250 255 

Trp Asp Ala Tyr Leu Ser Arg Val Ala 
260 265 

<210> 17 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer for amplification of a gene 
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fragment conferring resistance against chloramphenicol 



<400> 17 



atggagaaaa aaactagtgg atataccacc 



30 



<210> 18 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer for amplification of a gene 
fragment conferring resistance against chloramphenicol 



<210> 19 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer DU5 used for confirmation 
of destruction of Ogataea minuta URA3 gene 



<400> 18 



ctgagacgaa aaagatatct caataaaccc 



30 



<400> 19 



aggaagaaga ggaggaagag gaagaaac 



28 
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<210> 20 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer DUC5 used for conf irmation 
of destruction of Ogataea minuta URA3 gene 



<210> 21 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer DU3 used for confirmation 
of destruction of Ogataea minuta URA3 gene 



<400> 20 



cgatgccatt gggatatatc aacggtgg 



28 



<400> 21 



ccgtgtttga gtttgtgaaa aaccagggc 



29 



<210> 22 



<211> 28 



<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence-' primer DUC3 used for confirmation 
of destruction of Ogataea minuta LIRA 3 gene 



<210> 23 
<211> 14 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 23 

Phe Val Ala Thr Asp Arg He Ser Ala Tyr Asp Val He Met 
15 10 

<210> 24 
<211> 14 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 24 

Gin Asp Ser Tyr Asp Lys Gin Phe Leu Arg Asp Trp Leu Thr 
15 10 

<210> 25 
<211> 42 
<212> DNA 



<400> 22 



tgtggcgtgt tacggtgaaa acctggcc 



28 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer PAD5 for amplification of 
5' -region of Ogataea minuta ADE1 gene 

<400> 25 

ttygtngcna cngaymgnat hwsngcntay gaygtnatha tg 42 

<210> 26 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer PAD3 for amplification of 
3' -region of Ogataea minuta ADE1 gene 

<400> 26 

gtnarccart cncknarraa ytgyttrtcr tanswrtcyt g 41 

<210> 27 
<211> 2560 
<212> DNA 

<213> Ogataea minuta 

<220> 
<221> CDS 
<222> 939. . 1850 
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<400> 27 

gatatcccaa gaacctatgc cgagggttca 
cgtttcctga gtttcctcca acggccagga 
tgcgtgtggg cacgaacgag cccacgtaga 
acgcagtcac catcaattcc agaaagttct 
actttgtgag ataagctctg ctggcaagaa 
cgttgacgag atagttgagc tttgttccgt 
caggcatcac aagatccacc aacgtcagag 
acgtgacatt taacacaaga tcccgccagt 
tcagcgtggc aaaatcgatg cttg.cgcctt 
agaggaaaag gaccactgaa gggagaaata 
agaaatcttt atgcgtagtc ttggggttca 
tgtgaagagg attcgataaa gagaggggaa 
gtttttctga ccttttttgc tctttctcgt 
gcgtcgaaaa gagtcaagcc gcgatcgcga 
tgtaatttaa accatcacag tatataagta 
acagatcagc taaccaggac attccactgg 
tcgacggcat cttgccgcta attgccaagg 
aggaaagcct gctgttcgtg gcaacagacc 
atggaatcaa agacaagggt aaaatactga 
tgaaagacac tatcaagaac caccttatcg 
ttccacagga gctgtctcag ccaaagtaca 
tgagaaagca caaattgatc cccctggagg 
catggaagga gtacaacaag agcaagaccg 
aggagagtca agagttcccc gttccgattt 
aacacgacga aaacatttcc cccgagaaag 
cgcggctcgc agaaaaggct gtgcagctgt 
agggtatcat tctcgccgac acaaagtttg 
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gctcacggcc gataaaccaa tcaaagacaa 60 
ttatctcgtg agttcccaga ccgttcggct 120 
caaacaggct caaagccaac gaaaactcgt 180 
cgtggatgaa cgacagctca ggaaggttga 240 
ttcccacgag aagagtgctc aattctttcc 300 
ctcgtaacag gactccctct ttatggtagc 360 
tgaagaacca caccaggtaa accttccagc 420 
tgccgactat cttggactcg aaaagcgttt 480 
caaccacata ctcctcatta cagcaaaagt 540 
ctgacaaaac gaccgctccc ggtgtcccgc 600 
attcagacat ggtagattgg tgagggtaat 660 
cagcaccgga gatagttctt agatcaaaat 720 
ttagctcgcg tacagtcgac gcgtcggttt 780 
ttaaaaatga atccggagaa gtcaaaaata 840 
ggcgggaagc gcacaatttc taggcattcc 900 
agccaacaat gtcactcaca acaaccaacc 960 
gcaaagtcag agacatctat caagttgacg 1020 
ggatttccgc ctacgatgtg atcatggaga 1080 
ctcagctgtc agtattctgg tttgatttgc 1140 
catccactga cgacgaagtg tttgccagac 1200 
agtcgcagct gagtggaaga gcactggtgg 1260 
tgattgtcag aggctacatc accggaagtg 1320 
tgcacggtct cgaggttggc gcagagctga 1380 
tcaccccgtc aacgaaagct gaacaaggcg 1440 
ctgcagagat tgtcggggaa caactgtgtg 1500 
actccaaggc cagaacttac gccaaaagca 1560 
agtttggaat tgacgagaac gacgaattgg 1620 
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ttcttgtgga 


cgaggttttg acccctgatt 


cctcgagatt 


ttgggacgca 


aagacttaca 


1680 


agatcggaca 


gtcgcaggac 


tcttacgaca 


aacagtttct 


gagagactgg 


ctcacgtcca 


1740 


acggtctgaa 


cgggaaagac 


ggtgtctcta 


tgaccgcgga 


gatcgctgaa 


cgcacgggtg 


1800 


cgaagtacgt cgaggcattt gagtctctga 


' cgggaagaaa 


gtggacgtag 


tttttgataa 


1860 


tagtaaccct 


ggaaatttga 


tatgtggcgg 


tgtagtctgt 


ggcggtggaa 


taaaatctaa 


1920 


attgaattta gtcgcttccc 


aaaacagcaa 


tttgtcaaca 


cttagtctgt 


gcacagcctt 


1980 


gacggcattt 


gagccatccc 


agggtctggc 


agttacaggg 


ctttgatcaa 


aagaaaactg 


2040 


gtgaagtttg 


acaacaggct 


acagctgcca 


agtcgcaact 


tgg'gtagtag 


ctcattcgtc 


2100 


gaacaccagt 


gcgccatgtc 


catcgccaac 


gagttccagc 


ccttggagct 


tattggtagg 


2160 


ggttcctttg gatgtgttcg gaaagtgcgc 


cgcaagtcgg 


acggcaagat 


atttgtgaga 


2220 


aaggagatct 


cctacatcgc 


catgaacacc 


aaagagaagc 


agcagctcac 


agcagagttt 


2280 


cgtattctca 


gagaactaaa 


gcatcccaac 


attgtccatt 


atgtccacca 


cgaccacgtc 


2340 


caggaggaac 


agaccgtcca 


tctgtacatg 


gaatactgcg 


atgggggcga 


cttgtcggtg 


2400 


ttgatcagga 


agtacaaagg 


aaagaacgag 


tttatcccgg 


agaacttgat 


ctggcaaatc 


2460 


ttcacccagg 


ttctcaacgc 


tctctatcaa 


tgccactatg 


gggtcaatat 


tgaggctgtg 


2520 


caagaacttt 


tccagtccac 


tccagagatt 


gcaccccggg 






2560 



<210> 28 
<211> 303 
<212> PRT 

<213> Ogataea minuta 
<400> 28 

Met Ser Leu Thr Thr Thr Asn Leu Asp Gly He Leu Pro Leu He Ala 
1 5- 10 15 

Lys Gly Lys Val Arg Asp He Tyr Gin Val Asp Glu Glu Ser Leu Leu 



20 



25 



30 
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Phe Val Ala Thr Asp Arg He Ser Ala Tyr Asp Val He Met Glu Asn 
.35 40 45 

Gly He Lys Asp Lys Gly Lys He Leu Thr Gin Leu Ser Val Phe Trp. 
50 55 60 

Phe . Asp Leu Leu Lys Asp Thr lie Lys Asn His Leu lie Ala Ser Thr 
65 70 75 80 

Asp Asp Glu Val Phe Ala Arg Leu Pro Gin Glu Leu Ser Gin Pro Lys 

85 90 ,95 

Tyr Lys Ser Gin Leu Ser Gly Arg Ala Leu Val Val Arg Lys His Lys 
100 105 110 , 

Leu He Pro Leu Glu Val lie Val Arg Gly Tyr He Thr Gly Ser Ala 
115 120 125 

Trp Lys Glu Tyr Asn Lys Ser Lys Thr Val His Gly Leu Glu Val Gly 
130 135 140 

Ala Glu Leu Lys Glu Ser Gin Glu Phe Pro Val Pro He Phe Thr Pro 
145 150 155 160 

Ser Thr Lys Ala Glu Gin Gly Glu His Asp Glu Asn He Ser Pro Glu 
165 170 175 



Lys Ala Ala Glu He Val Gly Glu Gin Leu Cys Ala Arg Leu Ala Glu 
180 185 190 
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Lys Ala Val Gin Leu Tyr Ser Lys Ala Arg Thr Tyr Ala Lys Ser Lys 
195 200 205 

Gly lie He Leu Ala Asp Thr Lys Phe Glu Phe Gly lie Asp Glu Asn 
210 215 220 

Asp Glu Leu Val Leu Val Asp Glu Val Leu Thr Pro Asp Ser Ser Arg 

225 230 235 ,. 5 240 

Phe Trp Asp Ala Lys Thr Tyr Lys He Gly Gin Ser Gin Asp Ser Tyr 
245 250 255 

Asp Lys Gin Phe Leu Arg Asp Trp Leu Thr Ser Asn Gly Leu Asn Gly 
260 265 270 

Lys Asp Gly Val Ser Met Thr Ala Glu He Ala Glu Arg Thr Gly Ala 
275 280 285 

Lys Tyr Val Glu Ala Phe Glu Ser Leu Thr Gly Arg Lys Trp Thr 
290 295 300 

<210> 29 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 5' -primer for amplification 
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upstream region of URA3 structural gene 
<400> 29 

ccccgagctc aaaaaaaagg taccaatttc agctccgacg ccggagccca ctacgcctac 60 

<210> 30 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 3' -primer for amplification of 
upstream region of URA3 structural gene 



<210> 31 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer Dadl-5 used for destruction 
of Ogataea minuta ADE1 gene 



<400> 30 



gggaagcttc cccagttgta caccaatctt gtcgacag 



38 



<400> 31 



aaaaagcggc cgctcccggt gtcccgcaga aatctttatg cgtagtcttg 



50 
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<210> 32 
<2I1> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer Dadl-3 used for destruction 
of Ogataea minuta ADEl gene 

<400> 32 

cccccggatc ctttttttta agcttgttgt actccttcca tgcacttccg gtgatg 56 

<210> 33 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer Dad2-5 used for destruction 
of Ogataea minuta ADEl gene 

<400> 33 

ttttcacccc gtcaaggatc cctgaacaag gcgaacacga cgaaaacatt tcccccgag 59 

<210> 34 

<211> 44 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer Dad2-3 used for destruction 
of Ogataea minuta ADE1 gene 

<400> 34 

tttttgggcc cacctgggtg aagatttgcc agatcaagtt ctcc 44 

<210> 35 

<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer DA5 used for confirmation 
of destruction of Ogataea minuta ADE1 gene 

<400> 35 

gatgcttgcg ccttcaacca catactcctc 30 

<210> 36 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer DA3 used for confirmation 
of destruction of Ogataea minuta ADE1 gene 



<400> 36 
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aaaagttctt gcacagcctc aatattgacc 



30 



<210> 37 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D0U5 used for confirmati 
of destruction of Ogataea minuta ADE1 gene 



<210> 38 
<211> 10 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> variation 
<222> 3 

<223> Xaa=His or Arg 
<220> 

<221> variation 
<222> 4 

<223> Xaa=Ile or Val 



<400> 37 



atcgatttcg agtgtttgtc caggtccggg 



30 
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<400> 38 



Pro Gin Xaa Xaa Trp Gin Thr Trp Lys Val 



1 



5 



10 



<210> 39 
<211> 11 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 39 

Trp Tyr Ala Arg Arg He Gin Phe Cys Gin Trp 
1 5 10 

<210> 40 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer P0H5 for amplification 
5' -region of Ogataea minuta 0CH1 gene 



<400> 40 



ccncarcryr thtggcarac ntggaargt 



29 



<210> 41 



<211> 33 



<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence.: primer P0H3 for amplification 
3' -region of Ogataea minuta 0CH1 gene 



<210> 42 

<211> 2527 . 

<212> DNA 

<213> Ogataea minuta 

<220> 
<221> CDS 
<222> 508. . 1812 

<400> 42 

agatctgttg acactggtca agcgtgtagc caagagaata ggaaacggaa tttcatactg 60 
ccggcacaca aggacaataa gggcgtccgg ggctgtcgaa attgtcgaga ccgtagagct 120 
attgttacct caataagttg ctctacgatt gtttccgtct ttgacaaagc agtaggcctt 180 
tctcaaggtg gtgtacgggt gtttcatttt taatttgcat cgagaacgcg tagtgcgcca . 240 
atggatctgc agggggctcg gctgattgca ctgaaatttc agcaataaat agctgaggat 300 
attcaggcac aacggtacca acggggcagg cttgatcgcg aagcagcagg agaaggcagc 360 
gaagtgactg aagagacgag aaggagacga atcagcctac ccctggaacc ataaacaaag 420 
tcgagccgtt tttttaggga cagaaaccgt tctggatatt tattcgacgc agagactcgg 480 
tagtcatctc tacgttcagc acacaccatg aactatcacg acttgtacga tgatagcaaa 540 
cggcagtcgt tgatgcgaaa ggcgcgaaag ttcgctgaga tgaacaagaa gttggtggtg 600 
gtggtcattt taacgatgta cgttgtgtcg cgtctggcgt cggttggaag cacgaaacag 660 



<400> 41 



ccaytgrcar aaytgdatnc knckngcrta cca 



33 
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gagtcgattc caggactcac catgaaagag tcagagttag aggtgaattt taaaacattt 720 
ggaatggatc tgcagaagcg gaacgagcta ccggccgcaa gtgcaacgct gagagaaaaa 780 
ctatcgtttt acttccccta tgaccctgaa aaaccagtgc ccaaccaaat atggcagacg 840 
tggaaagtgg acatcaacga caaatcattc ccgagacact tccgtaagtt ccaagagaca 900 
tggccacaac taaacagcgg gtacacgtac catctcattc cagacagtat tgtggacgag 960 
ttcatgagga gtctttttgc caatgtccct gaggttattg cagcctacaa catgttaccg 1020 
aaaaatatcc tcaaggcgga ttttttccgg tatttggtga tttttgcgcg cggtggaact 1080 
tattcggata tcgacacgat ctgcctcaaa ccagtgaacg aatgggccac gtttaacgaa 1140 
caaactgtca tttcgcacta tctcaagacc aacggtaaaa cctcgcagtt gccagaagtg 1200 
gacccctcca cgcgcaaaac accgatcgga ctcaccattg gaatagaggc cgacccagac 1260 
agacccgact ggcacgaatg gtacgctaga cgtattcagt tctgtcaatg gacgatccag 1320 
ggcaagcaag gccatcccat gctgcgcgag ttgatcatcc gtatagtgga- gcaaactttc 1380 
cgcaaagagg ccatgggcaa tttgaaaaaa gtagagggga aggatatggg tggtgacatc 1440 
atgcagtgga caggacccgg ggttttcaca gataccctgt ttgattatct caataacgtg 1500 
gtgagtgacg gaaagctggg agacggttac ggagtcgggt ccaagtactg gaacagtcac 1560 
gccaagtaca agctgtctca cattgaggtg gatgccaaca acgagccgat gcactctgac 1620 
aagcaaacta tcagctggaa gtccatgagt aagctatcgg agcccctgat tatagatgac 1680 
gtgatgatcc tgccaatcac tagcttcagc cccggcgtgg gccagatggg ctcgcattcg 1740 
cccgaccacc cgctcgcatt tgtccggcac atgttccagg gcagctggaa accagatgca 1800 
gagaagatgt gactgcatat aggaacgcat tttatacagt agatcaagtt aaaagtttga 1860 
acttttgcgg ggaagtggtg taagggtgtt tgacgagggc ctgaacccgt gagtcaacgc 1920 
gcttggacgg aagaacgggt gcacgccgca tggggctgtt cgttcagttt tgacgctgct 1980 
aacgagagag tagcttgcag attgcaatcc cgactgagtc cacccggttg agctagtcac 2040 
acgactgcgt cttttctttc tggtgtacgg gtgtcaatac attttcggtt taaaaacgat 2100 
aagatgcaac aaggtatctt ctgtagctaa accccacttc tccagacacc ttccaccagc 2160 
cgatgactat gacagacagg tttttggagg attacaagaa gtttctcccc aaagcgcacg 2220 
atttgagggg cacgcactca cggcttttca cgacggcggg cggggccgat gcggggagtt 2280 
tggctgattg gagagagtgg acagatgatt tgggtcattc gcaggagtat tacgagctga 2340 
aacaggagat caattgtctt gttcttaact accttatcta cgaaggatat gttggtgctg 2400 



30/109 



WO 03/091431 




PCT/JP03/05464 



ttcgagagtt ttcgaaagag ctgggattcg attttatcgt ggaggagttg gaaggaattg 2460 
aagaggagaa gggaggccac caagaggacg gagagtacac gaccatgtca gacactgacg 2520 
tactagt 2527 

<210> 43 
<211> 434 . 
<212> PRT 

<213> Ogataea minuta 
<400> 43. i 

Met Asn Tyr His Asp Leu Tyr Asp Asp Ser Lys Arg Gin Ser Leu Met 
15 10 15 

Arg Lys Ala Arg Lys Phe Ala Glu Met Asn Lys Lys Leu Val Val Val 
20 25 30 

Val He Leu Thr Met Tyr Val Val Ser Arg Leu Ala Ser Val Gly Ser 
35 40 45 

Thr Lys Gin Glu Ser He Pro Gly Leu Thr Met Lys Glu Ser Glu Leu 

50. 55 60. 

Glu Val Asn Phe Lys Thr Phe Gly Met Asp Leu Gin Lys Arg Asn Glu 
65 70 75 . 80 

Leu Pro Ala Ala Ser Ala Thr Leu Arg Glu Lys Leu Ser Phe Tyr Phe 

85 90 95 



Pro Tyr Asp Pro Glu Lys Pro Val Pro Asn Gin He Trp Gin Thr Trp 
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100 - 105 110 

Lys Val Asp lie Asn Asp Lys Ser Phe Pro Arg His Phe Arg Lys Phe 
115 120 125 

Gin Glu Thr Trp Pro Gin Leu Asn Ser Gly Tyr Thr Tyr His Leu He 
130 135 140 

Pro Asp Ser He Val Asp Glu Phe Met Arg Ser Leu Phe Ala Asn Val 
145 150 155 160 

Pro Glu Val He Ala Ala Tyr Asn Met Leu Pro Lys Asn He Leu Lys 
165 170 175 

Ala Asp Phe Phe Arg Tyr Leu Val He Phe Ala Arg Gly Gly Thr Tyr 
180 185 190 

Ser Asp He Asp Thr He Cys Leu Lys Pro Val Asn Glu Trp Ala -Thr 
195 200 205 

Phe Asn Glu Gin Thr Val He Ser His Tyr Leu Lys Thr Asn Gly Lys 
210 215 220 

Thr Ser Gin Leu Pro Glu Val Asp Pro Ser Thr Arg Lys Thr Pro He 
225 230 235 240 

Gly Leu Thr He Gly He Glu Ala Asp Pro Asp Arg Pro Asp Trp His 
245 250 255 
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Glu Trp Tyr Ala Arg Arg lie Gin Phe Cys Gin Trp Thr He Gin Gly 
260 265 270 

Lys Gin Gly His Pro Met Leu Arg Glu Leu He He Arg He Val Glu 
275 280 285 

Gin Thr Phe Arg Lys Glu Ala Met Gly Asn Leu Lys Lys Val Glu Gly 
290 295 300 

Lys Asp Met Gly Gly Asp He Met Gin Trp Thr Gly Pro Gly Val Phe 
305 310 315 320 



Thr Asp Thr Leu Phe Asp Tyr Leu Asn Asn Val Val Ser Asp Gly Lys 
325 330 335 

Leu Gly Asp Gly Tyr Gly Val Gly Ser Lys Tyr Trp Asn Ser His Ala 
340 345 350 

Lys Tyr Lys Leu Ser His He Glu Val Asp Ala Asn Asn Glu Pro Met 
355 360 365 

His Ser Asp Lys Gin Thr lie Ser Trp Lys Ser Met Ser Lys Leu Ser 
370 375 380 



Glu Pro Leu He He Asp Asp Val 
385 390 

Ser Pro Gly Val Gly Gin Met Gly 
405 



Met He Leu Pro He Thr Ser Phe 
395 400 

Ser His Ser Pro Asp His Pro Leu 
410 415 
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Ala Phe Val Arg His Met Phe Gin Gly Ser Trp Lys Pro Asp Ala Glu 
420 425 430 

Lys Met 

<210> 44 
<211> 30 

<212> DNA ; • 

<213> Artificial Sequence 

<220> ' 

<223> Description of Artificial Sequence: primer D03 used for confirmation 
of destruction of Ogataea minuta 0CH1 gene 

<400> 44 

ccattgtcag ctccaattct ttgataaacg 30 

<210> 45 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence ' primer D05 used for confirmation 
of destruction of Ogataea minuta 0CH1 gene 



<400> 45 



34/109 



WO 03/09 J 431 




PCT/JP03/05464 



acacttccgt aagttccaag agacatggcc 30 

<210> 46 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D03-2 used for confirmation 
) of destruction of Ogataea minuta 0CH1 gene 

<400> 46 

tcaccacgtt attgagataa tcaaacaggg 30, 

<210> 47 
<211> 8 
<212> PRT 

<213> Saccharomyces cerevisiae 

) 

<400> 47 

Thr Asn Tyr Leu Asn Ala Gin Tyr 
1 5 

<210> 48 

<211> 8 

<212> PRT 

<213> Saccharomyces cerevisiae 



<400> 48 
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Lys Ala Tyr Trp Glu Val Lys Phe 



1 



5 



<210> 49 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<220> ' 

<223> Description of Artificial Sequence: primer PPA5 for amplification of 
5' -region of Ogataea minuta PEP4 gene 



<210> 50 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer PPA3 for amplification of 
3' -region of Ogataea minuta PEP4 gene 



<400> 49 



acnaaytayy tnaaygcnca rta 



23 



<400> 50 



aayttnacyt cccartangc ytt 



23 



<210> 51 
<211> 1951 
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<212> DNA 

<213>. Ogataea minuta 

<220> 
<221> CDS 
<222> 477. . 1709 

<400> 51 

catatgtatt catcaatcta cagcttttct aatcngtgtg acttcagtca catgatcctc 60 
tgacccgcca cgaccttgct ggcttccagc gcgcgaaact cactcccaat tttcggatta 120 
gctaatcacg aagatttttg gatttcctga tctgtagtgt atccatcctg ccttaatcgt 180 
tttcgataca tttgttatcc gaattgggaa tggcattagt cgtgcgccac ccgactcgcc 240 
acccccattc tagtggcaaa caggattgaa agagggctaa aaggtaactt agtgttttat 300 
ctctgaatct tccttctgat atcaatcaac aattgttaaa cgattgaaag ttttgaaaca 360 
ttcattgaac ttgcgaagcg ctcacacagc atcgttcggt tagcagttac aacagtttaig 420 
gtttttttcc ccacaaaaag gctcacgctg cctcctcact cttgcctctt ttcttgatga 480 
aactctcgct tgcattgctc gcccttggtg gtttccaaga ggcccacgcc aaggttcatc 540 
atgcgccaat caagaagact cctgccgcgg aaacttacaa ggacgtgagt ttcggcgact 600 
acgtggattc tctgaagggc aagtatgtct ctatgtttgc taagcatgct gcggagtcct 660 
cccaaaacgc ctttgtccct tttgttcagg aagtgcaaga cccagagttt actgttcagg 720 
agggacacaa ctcccctctc acgaactacg tgaacgctca gtacttcact gagattcaaa 780 
ttggtacccc gggccaaccg ttcaaggtca tcctcgacac tggttcgtcc aatttgtggg 840 
ttccaggctc ggattgttct tctcttgctt gctacctgca tcagaagtac gaccacgact 900 
cttcgtcaac ctacaaggcc aacggctctg aatttgctat cagatacggc tctggttcgc 960 
tggagggttt tgtctcccag gacaccctga ctcttggtga cctcatcatt ccaaagcaag 1020 
actttgccga ggccaccagt gagcpaggtc tcgcatttgc ctttggtaag tttgacggta 1080 
ttctcggact tgcgtacgac accatctcgg tggacaagat tgttcctcct atctacaacg 1140 
ctttgaacct ggggcttttg gacgagcctc agttcgcctt ctacctcgga gacactgcca 1200 
agtctgaggc agacggtgga gtggctactt tcggaggtgt tgacgaaact aagtacgacg 1260 
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gaaagatcac ttggttgcca gtgagaagaa 
tcgctcttgg tgacgagtac gcgactttag 
cctctttaat tgctttgcct tcccaattgg 
agaagtcctg gtccggccag tacaccattg 
tcactttcaa ctttgacggt tacaatttct 
tttcaggctc gtgcatttcc gccttcactc 
tcgccatcat tggtgatgct ttcctgagaa 
acgctgttgg attggctaag gccgtttaat 
ttaattctgc catcctggat tggcatgaat 
tggtatgttt attgctttta ttacgtgacc 
tgcactactt cactctttca ttggctttgg 
tgaattgcac aaatttaaag attgctctag 
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aggcttactg ggaggtgaag tttgacggta 1320 
acggatatgg cgctgccatc gacacaggta 1380 
ctgagatttt gaactctcaa atcggtgccg 1440 
actgtgaaaa gagagcatct ttgccagacc 1500 
ctatctccgc gtacgactac actcttgagg 1560 
cgatggactt ccctgcccca attggccctc 1620 
agtattactc cgtgtacgac ttgggcaagg 1680 
ctctagcctt ctagttattg attgctattg 1740 
ggttggttgg tacgcatata cggttggcgg ' 1800 
aaatgttggt ttttctttca ccttttactc 1860 
aagtacgtta tttttttcac cctatgtaac 1920 
a •<-*■■ 1951 



<210> 52 
<211> 410 
<212> PRT 

<213> Ogataea minuta 
<400> 52 

Met Lys Leu Ser Leu Ala Leu Leu Ala Leu Gly Gly Phe Gin Glu Ala 
1 .5 10 15 

His Ala Lys Val His His Ala Pro lie Lys Lys Thr Pro Ala Ala Glu 
20 25 30 

Thr Tyr Lys Asp Val Ser Phe Gly Asp Tyr Val Asp Ser Leu Lys Gly 
35 40 45 

Lys Tyr Val Ser Met Phe Ala Lys His Ala Ala Glu Ser Ser Gin Asn 

38/109 



,W0 03/091431 




PCT/JP03/05464 



50 55 • 60 

Ala Phe Val Pro Phe Val Gin Glu Val Gin Asp Pro Glu Phe Thr Val 
65 70 75 80 

Gin Glu Gly His Asn Ser Pro Leu Thr Asn Tyr Val Asn Ala Gin Tyr 

85 ; 90 95 

Phe Thr Glu He Gin He Gly Thr Pro Gly Gin Pro Phe Lys Val He 
100 105 110 

Leu Asp Thr Gly Ser Ser Asn Leu Trp Val Pro Gly Ser Asp Cys Ser 
115 120 125 

Ser Leu Ala Cys Tyr Leu His Gin Lys Tyr Asp His Asp Ser Ser Ser 
130 135 140. 

Thr Tyr Lys Ala Asn Gly Ser Glu Phe Ala He Arg Tyr Gly Ser Gly 
145 150 155 160 

Ser Leu Glu Gly Phe Val Ser Gin Asp Thr Leu Thr Leu Gly Asp Leu 
165 170 175 

He He Pro Lys Gin Asp Phe Ala Glu Ala Thr Ser Glu Pro Gly Leu 
180 185 190 

Ala Phe Ala Phe Gly Lys Phe Asp Gly lie Leu Gly Leu Ala Tyr Asp 
195 200 205 
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Thr He Ser Val Asp Lys He Val Pro Pro lie Tyr Asn Ala Leu Asn 
210 215 220 

Leu Gly Leu Leu Asp Glu Pro Gin Phe Ala Phe Tyr Leu Gly Asp Thr 
225 230 235 240 

Ala Lys Ser Glu Ala Asp Gly Gly Val Ala Thr Phe Gly Gly Val Asp 
245 250 255 

Glu Thr Lys Tyr Asp Gly Lys He Thr Trp Leu Pro Val Arg Arg Lys 
260 265 270 

Ala Tyr Trp Glu Val Lys Phe Asp Gly He Ala Leu Gly Asp Glu Tyr 
275 280 285 

Ala Thr Leu Asp Gly Tyr Gly Ala Ala lie Asp Thr Gly Thr Ser Leu 
290 295 300 

lie Ala Leu Pro Ser Gin Leu Ala Glu He Leu Asn Ser Gin He Gly 
305 310 315 320 

Ala Glu Lys Ser Trp Ser Gly Gin Tyr Thr He Asp Cys Glu Lys Arg 
325 330 335 

Ala Ser Leu Pro Asp Leu Thr Phe Asn Phe Asp Gly Tyr Asn Phe Ser 
340 345 350 

He Ser Ala Tyr Asp Tyr Thr Leu Glu Val Ser Gly Ser Cys He Ser 
355 360 365 
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Ala Phe Thr Pro Met Asp Phe Pro Ala Pro He Gly Pro Leu Ala He 
370 375 380 

He Gly Asp Ala Phe Leu Arg Lys Tyr Tyr Ser Val Tyr Asp Leu Gly 
385 390 395 400 

Lys Asp Ala Val Gly Leu Ala Lys Ala Val 
405 410 

<210> 53 
<211> 11 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> variation 
<222> 2 

<223> Xaa=Gly or Leu 
<400> 53 

Asp Xaa Asn Gly His Gly Thr His Cys Ala Gly 
15 10 

<210> 54 
<211> 11 
<212> PRT 

<213> Saccharomyces cerevisiae 
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<220> 

<221> variation 
<222> 6 

<223> Xaa=Ser or Thr 
<220> 

<221> variation 
<222> 9 

<223> Xaa=Val or He 
<220> 

<221> variation 
<222> 10 

<223> Xaa=Ala or Val 
<400> 54 

Gly Thr Ser Met Ala Xaa Pro His Xaa Xaa Gly 
1 5 10 

<210> 55 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer PPB5 for amplification 
5' -region of Ogataea minuta PRB1 gene 



<400> 55 
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gaybknaayg gncayggnac ncaytgykcn gg 



32 



<210> 56 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> , . 

<223> Description of Artificial Sequence: primer PPB3 for amplification 
3' -region of Ogataea minuta PRB1 gene 



<210> 57 

<211> 2214 

<212> DNA 

<213> Ogataea minuta 

<220> 
<221> CDS 
<222> 394. .2013 

<400> 57 

ggatcccctc tctcgctagc gagtttcgcc tgctctgcga taagagaaaa ccggctgtgc 60 

agctttcacc ccaacacgtc actttctgca gtcgtgcgcc ggcttgcatt aggtcgtgcg 120 

cagatcccaa atttgccacc agactaaatt ggggcattct ggtgagggaa taggggaaat 180 

aagagggtgt tttgacgttt catatacatt gctctttctt ttcttggacg gttagcggta 240 

ttgccataga ttatcttgcg cagttcagca tccttaggag ttattctttc ttgtaggtct 300 



<400> 56 



ccnrcnayrt gnggnwsngc 



catnwsngtn cc 



32 
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tttttcagaa cagaaaaatc gccaatcaca gaaagattca gtcctaattg aagccttatc 360 
ttatcttatc tcacctcaac cacttgaacc aaaatgaagt tatcccagtc tgctgcggtg 420 
gctattctgt cttcgttggc agcagtggag gccttggtca tcccgttatt tgacgacttg 480 
ccagcagagt ttgcccttgt tccaatggat gcgaaagcgg aagtcatttc tgacgttcct 540 
gtcgactcgg ccattagtga tgctcctatc gcggcactaa atgatgctcc aagccctctc 600 
gtcacatcgc tgatcgcatc tcaaaatttg attccaaact cttatattgt cgttttcaag 660 
aatggcctag cttccggggc agttgacttc cacatggagt ggctcaagga aacgcactcc 720 
caaaccctgg ctgctttgtc taaggacatg ccagcagaag aattggccgc cgaaggtttc 780 
gtttccgaaa gcattgatct tactgaggtg tttagcatct ccgatttgtt cagtggatat 840 
accggatact tcccggagaa ggtggttgac ctcatcagaa 'g'acaccctga cgtggcgttc 900 
gttgagcagg actcgagagt tttcgccgat aagtcgtcta ctcaaaacgg tgctccttgg 960 
ggtttgtcta gaatctctca cagagagcct ctcagtctcg gcaatttcaa cgagtacgtt 1020 
tacgacgatc ttgctggaga tggcgtcacg gcttatgtca ttgataccgg tatcaatgtg 1080 
aagcacgagc agttcggtgg cagagcagag tggggtaaga ccatcccaac cggtgatgat 1140 
gatattgacg gaaacggtca cggtactcac tgcgctggta caattggctc ggaagattat 1200 
ggagtttcta agaactccaa aattgtcgca gtgaaggttt tgagatctaa cggttctggt 1260 
tccatgtctg acgtgatcaa gggtgttgaa ttcgctgcaa atgatcacgt tgccaagtct 1320 
aaagccaaga aggacggttt caagggatcg actgccaaca tgtctttggg aggtggcaag 1380 
tctcctgctc ttgacttggc tgtcaatgcc gctgtcaaag ctggtttaca ctttgctgtt 1440 
gccgctggta acgacaatgc tgacgcatgc aactattctc ctgctgctgc agagaacgca 1500 
gtcactgttg gtgcgtccac tttgtctgac tctagagctt acttttccaa ctatggtaaa 1560 
tgtgttgaca tttttgctcc gggcttgaac atcctttcca cctacatagg ttctgacact 1620 
gccaccgcca ctctttctgg tacatcgatg gcctcccctc acgtttgtgg tctgttgacc 1680 
tactttttga gcttgcaacc agaatcgtcg tcgttgtttt cttcggcagc tatctcccct 1740 
gctcagctga agaagaacct gatcaagttt ggtacgaaga acgttttgtc tgagattcca 1800 
tcggacggaa ccccaaatat tctcatttac aacggtgctg gcaagaacat cagtgacttc 1860 
tgggcgtttg aagacgaggc ctcggccaag tccgacttga agaaggctgt cgatattgcc 1920 
acaagtgttg acttagacct gcaagatatc aaggagaagt tcaaccatat tttggaggag 1980 
gtcgccgaag aggttgctga tttgttcgat taggtttcta acaattcagt gatcttgtct 2040 
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ttactgtggt ttcggaaact gggtttagac agcggtcctg ttactcatat tgcgcttgat 2100 
cgcttttcct tttttttctg ttgtttggag tgtttgtttt tctggataat gtggttagtt 2160 
tttcaagttg cttccaatat tgtttgtcca gattagagtc attgcttgaa gctt 2214 

<210> 58 
<211> 539 
<212> PRT 

<213> Ogataea minuta 

<400> 58 . : 
Met Lys Leu Ser Gin Ser Ala Ala Val Ala lie Leu Ser Ser Leu Ala 
1 5 10 15 

Ala Val Glu Ala Leu Val He Pro Leu Phe Asp Asp Leu Pro Ala Glu 
20 25 30 

Phe Ala Leu Val Pro Met Asp Ala Lys Ala Glu Val He Ser Asp Val 
35 40 45 

Pro Val Asp Ser Ala He Ser Asp Ala Pro He Ala Ala Leu Asn Asp 
50 55 60 

Ala Pro Ser Pro Leu Val Thr Ser Leu He Ala Ser Gin Asn Leu He 
65 70 - 75 80 

Pro Asn Ser Tyr He Val Val Phe Lys Asn Gly Leu Ala Ser Gly Ala 

85 90 95 



Val Asp Phe His Met Glu Trp Leu Lys Glu Thr His Ser Gin Thr Leu 
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100 105 110 

Ala Ala Leu Ser Lys Asp Met Pro Ala Glu Glu Leu Ala Ala Glu Gly 
115 120 125 

Phe Val Ser Glu Ser He Asp Leu Thr Glu Val Phe Ser He Ser Asp 
130 135 140 

Leu Phe Ser Gly Tyr Thr Gly Tyr Phe Pro Glu Lys Val Val Asp Leu 
145 150 155 160 

He Arg Arg His Pro Asp Val Ala Phe Val Glu Gin Asp Ser Arg Val 
165 170 175 

Phe Ala Asp Lys Ser Ser Thr Gin Asn Gly Ala Pro Trp Gly Leu Ser 
180 185 190 

Arg He Ser His Arg Glu Pro Leu Ser Leu Gly Asn Phe Asn Glu Tyr 
195 200 205 

Val Tyr Asp Asp Leu Ala Gly Asp Gly Val Thr Ala Tyr Val He Asp 
210 215 220 

Thr Gly He Asn Val Lys His Glu Gin Phe Gly Gly Arg Ala Glu Trp 
225 230 235 240 

Gly Lys Thr He Pro Thr Gly Asp Asp Asp He Asp Gly Asn Gly His 
245 250 255 
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Gly Thr His Cys Ala Gly Thr He Gly Ser Glu Asp Tyr Gly Val Ser 
260 265 270 

Lys Asn Ser Lys He Val Ala Val Lys Val Leu Arg Ser Asn Gly Ser 
275 280 285 

Gly Ser Met Ser Asp Val He Lys Gly Val Glu Phe Ala Ala Asn Asp 
290 295 300 

His Val Ala Lys Ser Lys Ala Lys Lys Asp Gly Phe Lys Gly Ser Thr 
305 310 315 320 

Ala Asn Met Ser Leu Gly Gly Gly Lys Ser Pro Ala Leu Asp Leu Ala 
325 330 335 

Val Asn Ala Ala Val Lys Ala Gly Leu His Phe Ala Val Ala Ala Gly 
340 345 350 

Asn Asp Asn Ala Asp Ala Cys Asn Tyr Ser Pro Ala Ala Ala Glu Asn 
355 . 360 365 

Ala Val Thr Val Gly Ala Ser Thr Leu Ser Asp Ser Arg Ala Tyr Phe 
370 375 380 

Ser Asn Tyr Gly Lys Cys Val Asp He Phe Ala Pro Gly Leu Asn He 
385 390 395 400 



Leu Ser Thr Tyr He Gly Ser Asp Thr Ala Thr Ala Thr Leu Ser Gly 
405 410 415 
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Thr Ser Met Ala Ser Pro His Val Cys Gly Leu Leu Thr Tyr The Leu 
420 425 430 

Ser Leu Gin Pro Glu Ser Ser Ser Leu Phe Ser Ser Ala Ala He Ser 
435 440 445 

• ' ' ' ■ ■ 

Pro Ala Gin Leu Lys Lys Asn Leu He Lys Phe Gly Thr Lys Asn Val 
450 455 460 

Leu Ser Glu He Pro Ser Asp Gly Thr Pro Asn He Leu He Tyr Asn 
465 470 475 480 

Gly Ala Gly Lys Asn He Ser Asp Phe Trp Ala Phe Glu Asp Glu Ala 
485 490 495 

Ser Ala Lys Ser Asp Leu Lys Lys Ala Val Asp He Ala Thr Ser Val 
500 505 510 

Asp Leu Asp Leu Gin Asp He Lys Glu Lys Phe Asn His He Leu Glu 
515 520 525 

Glu Val Ala Glu Glu Val Ala Asp Leu Phe Asp 
530 535 

<210> 59 
<211> 9 
<212> PRT 

<213> Saccharomyces cerevisiae 
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<220> . 

<221> variation 
<222> 1 

<223> Xaa=His or Asn 
<220> 

<221> variation 
<222> 5 

<223> Xaa=Val or Thr 
<400> 59 

Xaa Tyr Asp Trp Xaa Phe Leu Asn Asp 

, 1 5 

<210> 60 
<211> 12 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 60 

Tyr Asn Leu Cys His Phe Trp Ser Asn Phe Glu He 
1 5 10 

<210> 61 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer PKR5 for amplification of 
5' -region of Ogataea minuta KTR1 gene 

<400> 61 

maytaygayt ggrynttyyt naayga 26 

<210> 62 
<211> 35 

<212> DNA ^ 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer PKR3 for amplification of 
3' -region of Ogataea minuta KTR1 gene 

<400> 62 

atytcraart tnswccaraa rtgrcanarr ttrta 35 

) 

<210> 63 
<211> 1930 
<212> DNA 

<213> Ogataea minuta 

<220> 
<221> CDS 
<222> 124. . 1335 



<400> 63 
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gagctctata 


tttagctttg 


gacattggta 


agtgagttct 


atagaaagac 


aggctacaaa 


acaatggcta 


gagcgaatgc 


gaggctgatc 


ttatgtggat 


acattttatc 


caagggctcg 


tccggctcga 


gttccagtgg 


cactgttgct 


gagaaaagcg 


ttgcaggcgc 


agccgagaaa 


aaggtgaagg 


ccacttttgt 


ctctttggcc 


tcgatcagac 


aggtcgaaga 


ccgtttcaac 


aacgacgcgg 


aattcaacga 


cgagttcaag 


accaagtatg 


gtgtcattcc 


aaaggaacag 


aaggctgctg 


ccaccagaga 


gcaaatgaga 


tcgtacagac 


acatgtgcag 


atacgagtcg 


gagtacgagt 


actactggag 


agtggagcca 


gacatcttca 


agttcatgaa 


ggacaacaag 


gagtacaagg 


agaccatccc 


aactctgtgg 


ccacagtacg 


ttgcccagga 


caacctgatc 


tacaatggat 


gtcacttctg 


gtctaacttc 


gaagcctaca 


ccaagtactt 


tgaggcgttg 


tggggagatg 


cccctatcca 


ctcgattgcc 


catttcttcg 


acgacgtcgg 


atacttccac 


gctgtcagag 


aggccaagaa 


ctgtgtctgc 


tactcctgta 


cccctaagtt 


ttaccaggag 


cagtacatcc 


attagttgac 


ccaggccacg 


tacaagtctt 


tcaatactaa 


attagctgga 


ggggtttctg 


taatttttgt 


ttcgagaaaa 


tgtctcccgg 


gggcaaacaa 


cgtgctctct 


acgcccgcag 


gtaaacgcaa 


tgtccgatac 


gatgagcttc 


gagaacgttc 


tcgaggttct 


caaggacttc 


ctctccttct 


cgacgatcat 
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ctagttggac tgttgatcgg ttgacttgac 60 
gaccaccaag gctggcaaat ttgcgagatt 120 
cggtttgcaa tctttgctac cgtgttggtt 180 
tctacttcgt atacgatttc gacgccagag 240 
aatactgaga aatctgccct cgcagtgggt 300 
agcgttcctg. cagctgacgt cccagatgga 360 
agaaaccagg atctgtggga gctggtgaac 420 
aacaagtatc attacgattg ggtgttcttg 480 
aaggtgacct ctcaggtctg ttcgggtaag 540 
tggagcttcc cttcgtggat cgacactgat 600 
aaggacaaga tcatctacgg agactccatc 660 
ggattcttct tcaaacaccc agaactcgca 720 
agcatcaaga tctactgtga cattgactac 780 
aagtcgtacg gatggaccat ttctcttcct 840 
aagaccacta ; gagacttcat gaaggaaaac 900 
aactttattt cggacgacgg aggaagcagc 960 
gaggtcggct cgctcgagtt ctggagaggc 1020 
gaccaggctg gtgggttctt ctacgaaaga 1080 
gttgctctgt tcatgcctaa ggacgaggtt 1140 
aatccgttcc acaactgccc gatcgacaac 1200 
aaccaagccg acgacttcac cttccagcac 1260 
atgggtttga aaaagcctgc taactgggag 1320 
ggttgatttc gcctggttgt tttttgtttt 1380 
ttcaagtgat acgagatgat tttcatctcc. 1440 
ataaatctac aaaaaaacgt gccagatact 1500 
ctgctactaa gtgttttgtt tctgtccaca 1560 
agattctgag tcagtctcga cgatcacaca 1620 
agaagactct gcgtctgagt gctccaagaa 1680 
cgacgtccat ctgggtgatc tttccattta 1740 
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cactgagtcc gagcgacttg agctgttgtc gaaactgaca tctattctga gcaatgacca 1800 

ccaattggtt tacgaggtag gatgggactt accaccgatc atattcagct tcctggactc 1860 

tgaatcttc'g cccagtgagg ggctgatgaa cagcaaggtc acggttcttt tcttgaagct 1920 

gtttgagctc ' 1930 

<210> 64 

<211> 403 
<212> PRT ■ 

<213> Ogataea minuta ■ > 

<400> 64' 

Met Ala Arg Ala Asn Ala Arg Leu lie' Arg Phe Ala He 'Phe 'Ala. Thr 
1 5 10 15 

Val Leu Val Leu Cys Gly Tyr He Leu Ser Lys Gly Ser Ser Thr Ser 
20 25 30 

Tyr Thr IleSer Thr Pro Glu Ser Gly Ser Ser Ser Ser Gly Thr Val 
■ 35 40 45 

Ala Asn -Thr Glu Lys Ser Ala Leu Ala Val Gly Glu Lys Ser Val Ala 
50 55 60 

Gly Ala Ala Glu Lys Ser Val Pro Ala Ala Asp Val Pro Asp Gly Lys 
65 70 75 80 

Val Lys Ala Thr Phe Val Ser Leu Ala Arg Asn Gin Asp Leu Trp Glu 

85 90 95 
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Leu Val Asn Ser He Arg Gin Val Glu Asp Arg Phe Asn Asn Lys Tyr 
100 105 110 ' 

His Tyr Asp Trp Val Phe Leu Asn Asp Ala Glu Phe Asn Asp Glu Phe 
115 120 125 

Lys Lys Val Thr Ser Gin Val Cys Ser Gly Lys Thr Lys Tyr Gly Val 
130 135 140 

He Pro Lys Glu Gin Trp Ser Phe Pro Ser Trp He Asp Thr Asp Lys 
145 150 155, 160 

Ala Ala Ala Thr Arg Glu Gin Met Arg Lys Asp Lys He He Tyr Gly 
165 . 170 175 

Asp Ser He Ser Tyr Arg His Met Cys Arg Tyr Glu Ser Gly Phe Phe 
180 185 190 

Phe Lys His Pro Glu Leu Ala Glu Tyr Glu Tyr Tyr Trp Arg Val Glu 
195 200 205 

Pro Ser He Lys He Tyr Cys Asp He Asp Tyr Asp He Phe Lys Phe 
210 215 220 

Met Lys Asp Asn Lys Lys Ser Tyr Gly Trp Thr He Ser Leu Pro Glu 
225 230 235 240 



Tyr Lys Glu Thr He Pro Thr Leu Trp Lys Thr Thr Arg Asp Phe Met 
245 250 255 
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Lys Glu Asn Pro Gin Tyr Val Ala Gin Asp Asn Leu He Asn Phe He 
260 265 270 

Ser Asp Asp Gly Gly Ser Ser Tyr Asn Gly Cys His Phe Trp Ser Asn 
275 280 285 

Phe Glu Val Gly Ser Leu Glu Phe Trp Arg Gly Glu Ala Tyr Thr Lys 
290 295 300 

Tyr Phe Glu Ala Leu Asp Gin Ala Gly Gly Phe Phe Tyr Glu Arg Trp 
305 310 315 320 

Gly Asp Ala Pro He His Ser He Ala Val Ala Leu Phe Met Pro Lys 
325 330 335 

Asp Glu Val His Phe Phe Asp Asp Val Gly Tyr Phe His Asn Pro Phe 
340 345 350 

His Asn Cys Pro He Asp Asn Ala Val Arg Glu Ala Lys Asn Cys Val 
355 360 365 

Cys Asn Gin Ala Asp Asp Phe Thr Phe Gin His Tyr Ser Cys Thr Pro 
370 375 380 

Lys Phe Tyr Gin Glu Met Gly Leu Lys Lys Pro Ala Asn Trp Glu Gin 
385 390 395 400 



Tyr He His 
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<210> 65 
<211> 10 
<212> PRT 

<213> Saccharomyc.es cerevisiae 
<400> 65 

Thr Ser Trp Val Leu Trp Leu Asp Ala Asp 
1 5 .10 

<210> 66 

<211> 10 

<212> PRT 

<213> Saccharomyces cerevisiae 

<400> 66 

Glu Thr Glu Gly Phe Ala Lys Met Ala Lys 
1 5 10 

<210> 67 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence* primer PMN5 for amplification 
5' -region of Ogataea minuta MNN9 gene 
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<400> 67 



acnwsntggg tnytntggyt ngaygcnga 



<210> 68 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer PMN3 for amplification 
3' -region of Ogataea minuta MNN9 gene 



<210> 69 

<211> 2221 

<212> DNA 

<213> Ogataea minuta 

<220> 
<221> CDS 
<222> 931. . 2034 

<400> 69 

gggcccagag gaggaggtag gagtgggggg atcgccaatt ttctggcgcg ctactccgcg 60 

ctgccccgca cttcggccgc accgccaacc ttttctttgc gcaccccctc tccaacgttt 120 

ccggcctttc cattgaagcg agctaagcag tcagagagca ccaccgggag acgtgatcgc 180 

agcccctgtt tgccccgaaa ccgggttaga caaaacgcgg ttcggcttca acggactctc 240 



<400> 68 



ttngccatyt tngcraancc ytcngtytc 



29 
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atctttcaga. tggccgcagg ttgctggcag 

ggctagtcag tagggcacc.c tgcgggttag 

gcatcgcagg agtgttaagc ggcagaaaag 
taaccgtagt cggcgcgcga- gtccgagaag 

ttacccaaaa gaagcaggac tggaaggaaa 

ggcacactgg acggtcatcc tagtgtggtt 

ggtccatccc ctagcaaact cgagccggat 

cgcctcgaaa tccccttgaa gtacacagcc 

aagacgcaaa acgccttact ttgaacaaca 

agaaacacaa gagctttacc aacctctaaa 

catctctttc agcaccatta tagaactccg 

attggctaaa taatcagtgc tgatttggac 

gtacaccaga tacgaaggaa acccgtgaag 

ttgttgtttc tggtgtttgg aggctcgtct 

tacaagcgca acaacagaga tgaggtgatt 

cactatgacc tgaacaacct tgcgtcgacg 

ttgattttga cgccaatggc gaagtttctg 

acatatccac gtgacctgat cgagctcgga 

caagcattga agaagctgga gcacgcggtg 

gaacctaagt tcgccaaagt cacgatcctc 

tcggaaaagg acagacacgc gttcaaggtg 

gccagaaact cgctgctgtt caccaccatt 

gactcagata tcgtggagtc gcctcacacg 

ccagtcattg ctgccaattg ctaccagaga 

atccgtcctt acgacttcaa caactggatc 

acgatgtcgg acgacgagat catcgtggaa 

ctgatgggcc atttctatga tcctaacggc 

gtcggaggaa cctgtctgat ggtgaaggcc 

aacttcccct tctaccatct catcgaaacc 
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cttcgggctg 


acaacatcat 


ggctgacgta 


300 


taagtctccc 


tgcaggtcac 


cgttgcttga 


360 


aggaggtgga 


gtggggacga 


gagatccggg 


420 


ttaatcgacg 


cgtcgaaact 


gggtcttttg 


480 


cagaccggga 


ttggtgtgta 


tttctgtcag 


540 


ccgctcaccg 


cttacctggc 


tggtgttcct 


600 


caccctattc 


tggccggttt 


tgctatttcc 


660 


tgaaatttgg 


ctttttcttc 


actgtcgtgc 


720 


tcaacatcta 


gcaaatgctg 


acgaaatttg 


780 


aaataaccta 


ggctcccgtt 


tgcagctccg 


840 


gaaagcatat 


tcacagcacg 


tgagacgcgg 


900 


atgttgaaag 


gcgttttgaa 


acaccctctg 


960 


gtgttggttc 


ccgtcttcgg 


attggctgtt 


1020 


tccaacagaa 


agaccaacag 


tccctactcg 


1080 


ccacgtaatt 


tgccagcgga 


tcacatctcc 


1140 


ccgatggctg 


cttacaacaa 


ggagagagtg 


1200 


gacggatact 


gggacaactt 


gctgaaattg 


1260 


ttcattgtgc 


cgcgcacagc 


agagggagac 


1320 


aagattatcc 


agaacccaaa 


gaacaccaag 


1380 


agacaggaca 


acgagtccct 


ttcgtcacag 


1440 


cagaaagaac 


ggcgcgcaca 


aatggccaca 


1500 


ggcccgtaca 


cctcatgggt 


tctgtggctt 


1560 


ttgatccagg 


atcttgtttc 


gcacgacaag 


1620 


tactacgacg 


aggacaagaa 


ggaggactcc 


1680 


gagtctgaag 


agggactacg 


gatcgcatcc 


1740 


gcgtacgcag 


aaattgccac 


ctatcgtcca 


1800 


gacctgggaa 


ccgagatgca 


actggatggt 


1860 


gacgtccatc 


gcgacggggc 


catgttcccg 


1920 


gaagggttcg 


ccaaaatggc 


caaacggctt 


1980 
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ggctaccagg tgtttggtct tccaaactat cttgttttcc actacaacga gtgactcttg 2040 
gtcttttata tagttgagca aaaatgaaaa aacatgtcaa aaatagcaag acaacgtgaa 2100 
atgtgtcgcg acgcgacgcc gtagttgttg caccgcaacg cgaacttctg tcgcgcctgt 2160 
caactagaat aggttcgcac acgaccccac cgttccgatt tccttatcag caaagagatc 2220 
. * 2221 

<210> 70 
<211> 367 
<212> PRT 

<213> Ogataea minuta 
<400> 70 

Met Leu Lys Gly Val Leu Lys His Pro Leu Val His Gin He Arg Arg 
15 10 15 

Lys Pro Val Lys Val Leu Val Pro Val Phe Gly Leu Ala Val Leu Leu 
20 25 30 

Phe Leu Val Phe Gly Gly Ser Ser Ser Asn Arg Lys Thr Asn Ser Pro 
35 40 45 

Tyr Ser Tyr Lys Arg Asn Asn Arg Asp Glu Val He Pro Arg Asn Leu 
50 55 60 

Pro Ala Asp His He Ser His Tyr Asp Leu Asn Asn Leu Ala Ser Thr 
65 70 75 80 



Pro Met Ala Ala Tyr Asn Lys Glu Arg Val Leu He Leu Thr Pro Met 

85 90 95 
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Ala Lys Phe Leu Asp Gly Tyr Trp Asp Asn Leu Leu Lys Leu Thr Tyr 
100 105 110 

Pro Arg Asp Leu lie Glu Leu Gly Phe He Val Pro Arg Thr Ala Glu 
115 120 125 

Gly Asp Gin Ala Leu Lys Lys Leu Glu His Ala Val Lys He He Gin 
130 135 140 

Asn Pro Lys Asn Thr Lys Glu Pro Lys Phe Ala Lys Val Thr He Leu 
145 150 155 160 

Arg Gin Asp Asn Glu Ser Leu Ser Ser Gin Ser Glu Lys Asp Arg His 
165 170 175 

Ala Phe Lys Val Gin Lys Glu Arg Arg Ala Gin Met Ala Thr Ala Arg 
180 185 190 

Asn Ser Leu Leu Phe Thr Thr He Gly Pro Tyr Thr Ser Trp Val Leu 
195 200 205 

Trp Leu Asp Ser Asp He Val Glu Ser Pro His Thr Leu He Gin Asp 
210 215 220 

Leu Val Ser His Asp Lys Pro Val He Ala Ala Asn Cys Tyr Gin Arg 
225 230 235 240 



Tyr Tyr Asp Glu Asp Lys Lys Glu Asp Ser He Arg Pro Tyr Asp Phe 
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245 250 255 

Asn Asn Trp lie Glu Ser Glu Glu Gly Leu Arg He Ala Ser Thr Met 
260 265 270 

Ser Asp Asp Glu He He Val Glu Ala Tyr Ala Glu He Ala Thr Tyr 
275 280 285 

Arg Pro Leu Met Gly His Phe Tyr Asp Pro Asn Gly Asp Leu Gly Thr 
290 295 300 

Glu Met Gin Leu Asp Gly Val Gly Gly Thr Cys Leu Met Val Lys Ala 
305 310 315 320 

Asp Val His Arg Asp Gly Ala Met Phe Pro Asn Phe Pro Phe Tyr His 
325 330 335 

Leu He Glu Thr Glu Gly Phe Ala Lys Met Ala Lys Arg Leu Gly Tyr 
340 345 350 

Gin Val Phe Gly Leu Pro Asn Tyr Leu Val Phe His Tyr Asn Glu 
355 360 365 

<210> 71 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: primer DMN5 



<400> 71 



agatgaggtg attccacgta atttgccagc 



30 



<210> 72 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer DMN3 

<400> 72 

ttttgattgt catctatttc gcacaccctg 30 

<210> 73 

<211> 12 

<212> PRT 

<213> Pichia pastoris 

<400> 73 

Gly Gly Gly Ser Ser lie Asn Phe Met Met Tyr Thr 
1.5 10 

<210> 74 

<211> 10 

<212> PRT 

<213> Pichia pastoris 
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<400> 74 



Asp Met Trp Pro Met Val Trp Ala Tyr Lys 



1 



5 



10 



<210> 75 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer PAX5 for amplification of 
5' -region of Ogataea roinuta AQX1 gene 



<210> 76 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer PAX3 for amplification of 
3' -region of Ogataea minuta A 0X1 gene 



<400> 75 



ggnggnggnw snwsnathaa yttyatgatg tayac 



35 



<400> 76 



ttrtangccc anaccatngg ccacatrtc 



29 
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<210> 77 
<211> 5817 
<212> DNA 

<213> Ogataea minuta 

<220> 
<221> CDS 



<222> 234$. 


.4340 




<400> 77 






aagctttctt 


tcgcaaacag 


ctctttggta 


aggcgcactt 


taaaagataa 


tctacatcca 


tttgcgatta 


cgtaagccac 


aaaatttcag 


ggctagcgat 


agaatgtatc 


agtgttatta 


aagatccgaa 


gaaagggagc 


aagaaggcca 


tcaatgtagc 


tgttcaggtc 


tttcaacagg 


caagtcgttg 


ctgcggttcg 


actggcatag 


ccgacgtgca 


gggtgttttc 


gggcttgatg 


tggaggattt 


tcaccccgta 


ggacttgtcg 


ttgaaagcgc 


accatctgat 


tttcttagag 


cgcagaacca 


cctcttccag 


ctgctcgcga 


ccaacggcac 


gctcctttct 


gatgatgatg 


gagaagatct 


cctgcgcaaa 


gtccagggtc 


atgttttcaa 


gcagccattt 


gtctttggag 


tagcggaata 


ggtagtttgc 


ttcgcctccc 


gtgagctttg 


gggtggtctt 


cacctcgtct 


gagccgtcgg 


agtagcccgc 


ggcagtgacg 


ttgatgagat 


ggggcaagat 


cggcgacgag 


gtgatgtctc 


cgtcaaaagt 


cacgagctgg 
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gaggagaata gagtgcccag ctgataaaga 60 
gaaaaataaa aaaataaaac tgaaccggca 120 
gaaactcgta caagatcagg ttggcgaggg 180 
gtggct.ctag gagtagaaaa caatagaata 240 
cgccagacgt tctagtaggt agcccaatcg 3Q.0. 
ttcttggtct cgtctggact ggagatccaa 360 
tcgttggcgc cgagggagct gaactggtcg 420 
gttcggttgt cgttgcagct gaggaactct 480 
ccaatatcga cccagatgtc cgagcctccg 540 
gaagggaagt ggcggagacg tttgtctacg 600 
acgagtttgt agccttcttt ggggaccagt 660 
gcatgcaatg aaagtttctt gatcaggtcc 720 
cggatgatgt cttcctcgga ccagtcgagc 780 
aagaactcga gtccgccaag ctcgttggag 840 
atcaccagaa cgttctggcg ctgtctgtcg 900 
atgagcccct tgagtcgggc gtagtacttg 960 
atgccgacat agaggtcttt ggcgagcagc 1020 
gcgtcgaagt tggagccgtc atcgtagaga 1080 
agtctgcggt gtacggatgt tttgttgtgg 1140 
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aaagtgttgg agagctcgag aagttgcgcc gtgttcagaa tgagccgaat gtcgttgaac 1200 
gagggcgcta caagtctcct tttgctgatt gtgcggcgtc cgtcctcgat gtagaacgcc 1260 
ttctccaggg gcaatcgggt gaagaaacag ccaacggaag gcaccaattg gaccaatctg 1320 
gacatttcag gcattcccgc ctgggtcatc tcgatgttgt cgttgatcag cagctcgagg 1380 
tcatggaaga tttccgcgta gcgtcgcttc gcttccgaat tcaccatgag gtcgtccact 1440 
gcggagatcc cattggactt gactgcatag agaacaaacg gggtggccag caagcccttg 1500 
atccactcaa tcagtccgtc tcggcggtgc tccttgagcg cgtactcgac tctgtatctg 1560 
gttgtcattt gcgggagggg tgtaaagcag ctcagccggt gactgtgcaa ggacgaacgg 1620 
ttcctacttg aatgctaggc tggctaattg ggtatggcac aaacggcaca aacggcagat 1680 
gactgcaaat gacgacggta aacagaatcc actcagctgg cactaactgg gtgtagacta 1740 
agagttcgag 'ccggggaggg agtgacgatg cagccagaaa aagagccggt acgcaatcag 1800 
ggaaatagcc gtcaaaagaa aaacagaagg ggctgcagtt ttgctgccgc ccgccgcgcg 1860 
cccgcgctgg ctttccccgg ccggggaggc agccggctaa agaaaatagc ctatt'tcgat 1920 
ttcgcgtagc ccctcggttg "cctattgagg gttacttttc* gctccctctt ttgggccaac 1980 
tgacagtttg tggggtaaca acggtgtccg aggccagcta ttcggcaaac aatagacaga 2040 
ttagagacct actacggagt ttcagtgtct tcggaagctg cacagcccga atgtcggagc 2100 
ccgtgtgacg acacccccgc atggcttttg gcaatctcac atcgcccctc cctgcgtctc 2160 
cactctgggc atgagcagtg gtgtgcctgg tgtatctctg ; gcccccgcgg ggcagacagc 2220 
aaactgcgta taaatagcta cttccatctc ctacttgttg caccattgcc atagtaagaia 2280 
aagaagcaga tcactcaact tgttcaaaga ctcttgtgtt ctgttacg'ac ttacgactta 2340 
cgaaaaaaat ggctattcct gacgaattcg atatcatcgt tgtgggtgga ggctcatgcg 2400 
gctgcgccat cgccggtaga ctcggtaacc tcgacccgga cgttactgtg gctctcatcg 2460 
agggtggtga gaacaacatc aataacccat gggtctacct tcctggtgtc tatccaagaa 2520 
acatgagact cgactccaag acggctacct tctacaactc gagaccatcc aagcacctga 2580 
acggcagaag ggccattgtc ccctgcgcta acattcttgg tggaggttcc tccatcaact 2640 
tcctcatgta caccagagcc tcggcctccg actacgacga ctgggagcaa gagggatgga 2700 
ccaccgacga gctgcttccg ctcatgaaga agctcgagac gtatcaacgt ccttgcaaca 2760 
acagggaggt gcacggtttc gacggtccga tcaaggtctc cttcggtaac tacacctacc 2820 
caactgccca agacttcctg agagcctgcg agtcgcaggg tattcctttc aacgacgatc 2880 
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ttgaagacct 


caaggcctcg 


cacggagctg 


tcggtagaag 


atcggactcg 


gcacacgcct 


atctgttcct 


cattacgtcc 


. accaaggctg 


tcggtgtcag 


gaccgttcca 


atgaagccgg 


tcaaggccag 


aaagcaaatt 


gtggtttcgt 


aaagatctgg 


tatcggtgcg 


gcccacaagc 


acttgcctgg 


tgtcggtgag 


aacttccagg 


ccaagccaga. 


ggt.tccaacc 


tttgacgact 


ccgcctttga 


ccagtggtac 


tccaacaagg 


ctggtgtcaa 


gatcagacca. 


accgacgagg 


aagggtacca 


cgagtacttt 


gacaacaagc 


tttccggttt 


cttcggtgac. 


cacaccaaga 


actttttgga 


gtacccattt 


tcgagaggtt 


aagctccaga 


ctttgatcca 


ggtttcctga 


ggtcttacaa 


gaagtcgaga 


cagacagcca 


cctcgcacca 


cccactctac 


ccggttgact 


agacatgcaa 


ggcatttgct 


ggaccaaacc 


ggactgttcc 


aattgagaag 


ccaacgccaa 


tcgagatctt 


ctccgacatt 


gactactctg 


tcaaggagca 


cactgagacc 


acctggcact 


aaggtagcaa 


gatcgctcca 


aagggtggtg 


tgaagaacct 


caaggttgcc 


gacctgtcga 


actccactgc 


tcttctgatt 


ggtgagaagg 


actcaggatc 


tgatctcgcc 


atgaccattc 


agggtctggc 


tagattctaa 


gagacggtgc 


ttctgttttt 


ctctacgatt 


ctacttgatg 


ctctctatag 


ttttgactgt 


aatgaaaata 


atacggggcg 


tctggtcacg 


tgatgtatca 


tttataacga 


aaaaaagagt 


ctaaatgaaa 
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agtactggct caagtggatc aacagggatc 2940 
acatccaccc taccatgaga aacaagagca 3000 
acaaggtgat cattgagaac ggcgttgctg 3060 
tcgagaccaa aaaccctcca agcaggatct 3120 
gcggtacgat ctcctctcca ttggtgctgc 3180 
tgagacaagc gggcatcaag ccgatcgtcg 3240 
accactactg cttcttcacc ccatactatt 3300 
ttgtcagagg tgacccagtc gctcaaaagt 3360 
acggtcctct taccaccaac. ggtatcgagg 3420 
agttggccac. ggctgacgat gacttcatcc 3480 
cagacaagcc actgatgcat tact.ctgtca 3540 
ttccaaacgg caagttcttc accatgttcc 3600 
tcgtttatgc tgtttcccca gacccatacg 3660 
acgattccag agacatgtgg cctatggttt 3720 
gaagaatgga gtcgtttgcti.ggtgaagtca 37S0 
ctccagcccg tgccaaggac ttggatctcg 3840 
acttcaccgc caacttgtac cacggttcct 3900 
agaacgactc gcacgtgacc tgcaaccagg 3960 
ccgaggacga tgaggctatt gtcaagtaca 4020 
gtttgggaac ctgttcgatg gctccacaag 4080 
ttgtcgatgc cagattgaac gtgtacgaag 4140 
tctgcccaga taacgttgga tgtaatactt 4200 
ctgccacttt ggtcgccgag gacctgggat 4260 
caaacttcaa gctaggtact tacgaggaga 4320 
ccgactcttg ttcaattctt ttggttcttt 4380 
atgtatgacg agtgaagatt gtgttttttt 4440 
gtctacatga atgaaagaga tagctgacca 4500 
cgtgatcttt aagttttcga aatgactaaa 4560 
aaaaatcgat ctctgccaaa gactcatcga 4620 
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taggctaact caggaagcat tccgagcaac gcataatgcc ctcaaccaca gtctcagaga 4680 
tgcgcaaaaa ggtgctgatg atcgacaatt acgactcgtt cacatggaac ttgtacgagt 4740 
atctttgtca agagggagcc gatgtcgagg tctatcgtaa cgacaagatc acaattgaag 4800 
aaatcgagga aatgaagcct gacattatag tgatttcgcc aggccccgga catccgagat 4860 
cggactctgg tatctctcga aagactattg agattttcaa gggccggatt cctgtttttg 4920 
gagtgtgcat gggccaacag tgcatttacg aggttttcgg gggagacgtt gagtacgctg 4980 
gtgaaattgt tcacggaaaa acctcttctg tgacccacga caatcgtgga gtcttcaaga 5040 
acgttccgca gggagttgct gtgacgagat acc.attcgtt ggctggaacg ctaaaaactt 5100 
tgcccagcga gttggaggtg actgcccgta ccactaacgg tatcattatg ggtgtcaggc 5160 
ataaaagata cactattgaa ggcgttcagt ttcacccgga gtccattttg acagaagagg 5220 
ggcacttaat gatcaagaac attttgaaga gtagcggtgg ttactggaac gaggaggagg 5280 
aggaggtaaa acaaggcggt gccaagaagg agtcgatttt agacaagatt tacggcgaga 5340 
gaaaaaaggc gtacgaagag attgaaaaae agccaggtcg ctcgtttgcc gatttggagg 5400 
cctatttgga gctatgtggt gccccagacg ttttgaactt ctacgaccgg ttaaatgaga 5460 
acgtcaagca aggaaagcct gccattttga gtgaaatcaa gagagcctcg ccttcgaaag 5520 
gggctattca gatgggtgcc aatgctgcaa aacaggcgta cacctatgcc • acggcggggg 5580 
tttcggctat atccgttttg acagagccaa actggttcaa aggaacgatt gaggatttac 5640 
gagttgcgcg tcagacggtt ggaaaactcc aacaccgtcc gtgcattttg cgcaaggagt 5700 
ttgtcttttg caagtaccag attctggagg ccagactggc gggcgcagac actgttttgc 5760 
tgattgtcaa gatgctttca ggagctgagt tgcgcgaact ggttggctat tcccggg 5817 

<210> 78 
<211> 663 
<212> PRT 

<213> Ogataea minuta 
<400> 78 

Met Ala He Pro Asp Glu Phe Asp He lie Val Val Gly Gly Gly Ser 
15 10 15 
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Cys Gly Cys Ala He Ala Gly Arg 
20 

Thr Val Ala Leu He Glu Gly Gly 

35 40 

Val Tyr Leu Pro Gly Val Tyr Pro 
50 55 . 

Thr Ala Thr Phe Tyr Asn Ser Arg 
65 70 

Arg Ala He Val Pro Cys Ala Asn 

85 

Asn Phe Leu Met Tyr Thr Arg Ala 
100 

Glu Gin Glu Gly Trp Thr Thr Asp 
115 120 



Leu Gly Asn Leu Asp Pro Asp Val 
. 25 30 

Glu Asn Asn He Asn Asn Pro Trp 

45 

Arg Asn Met Arg Leu Asp Ser Lys 
60 

Pro Ser Lys His Leu Asn Gly Arg 
75 80 

He Leu Gly Gly Gly Ser Ser He 
90 95 

Ser Ala Ser Asp Tyr Asp Asp Trp 
105 110 

Glu Leu Leu Pro Leu Met Lys Lys 
125 



Leu Glu Thr Tyr Gin Arg Pro Cys Asn Asn Arg Glu Val His Gly Phe 

130 135 140 

Asp Gly Pro He Lys Val Ser Phe Gly Asn Tyr Thr Tyr Pro Thr Ala 

145 150 155 160 



Gin Asp Phe Leu Arg Ala Cys Glu Ser Gin Gly He Pro Phe Asn Asp 
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165 170 175 

Asp Leu Glu Asp Leu Lys Ala Ser His Gly Ala Glu Tyr Trp Leu Lys 
180 185 190 

Trp lie Asn Arg Asp Leu Gly Arg Arg Ser Asp Ser Ala His Ala Tyr 

195 200 205 

... j ■ . ■ ; 

He His Pro Thr Met Arg Asn Lys Ser Asn Leu Phe Leu He Thr Ser' 
210 215 220 

Thr Lys Ala Asp Lys Val He He Glu Asn Gly Val Ala Val Gly Val 
225 230 235 240 

Arg Thr Val Pro Met Lys Pro Val Glu Thr Lys Asn Pro Pro Ser Arg 
245 250 255 

He Phe Lys Ala Arg Lys Gin He Val Val Ser Cys Gly Thr lie Ser 
260 265 270 

Ser Pro Leu Val Leu Gin Arg Ser Gly He Gly Ala Ala His Lys Leu 
275 280 285 

Arg Gin Ala Gly He Lys Pro He Val Asp Leu Pro Gly Val Gly Glu 
290 295 300 

Asn Phe Gin Asp His Tyr Cys Phe Phe Thr Pro Tyr Tyr Ser Lys Pro 
305 310 315 320 
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Glu Val Pro Thr Phe Asp Asp Phe Val Arg Gly Asp Pro Val Ala Gin 
325 330 335 

Lys Ser Ala Phe Asp Gin Trp Tyr Ser Asn Lys Asp Gly Pro Leu Thr 
340 345 350 

Thr Asn Gly He Glu Ala Gly Val Lys He Arg Pro Thr Asp Glu Glu 
355 360 ; 365 

Leu Ala Thr Ala Asp Asp Asp Phe He Gin Gly Tyr His Glu Tyr Phe 
370 375 . 380 

Asp Asn Lys Pro Asp Lys Pro Leu Met His Tyr Ser Val He Ser Gly 
385 390 395 400 

Phe Phe Gly Asp His Thr Lys He Pro Asn Gly Lys Phe Phe Thr Met 
405 410 415 

Phe His Phe Leu Glu Tyr Pro Phe Ser Arg Gly Phe Val Tyr Ala Val 
.420 425 430 

Ser Pro Asp Pro Tyr Glu Ala Pro Asp Phe Asp Pro Gly Phe Leu Asn 
435 440 445 

Asp Ser Arg Asp Met Trp Pro Met Val Trp Ser Tyr Lys Lys Ser Arg 
450 455 460 

Gin Thr Ala Arg Arg Met Glu Ser Phe Ala Gly Glu Val Thr Ser His 
465 470 475 480 
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His Pro Leu Tyr Pro Val Asp Ser Pro Ala Arg Ala Lys Asp Leu Asp 
485 490 495 

Leu Glu Thr Cys Lys Ala Phe Ala Gly Pro Asn His Phe Thr Ala Asn 
500 505 510 

Leu Tyr His Gly Ser Trp Thr Val Pro lie Glu Lys Pro Thr Pro Lys 
515 520 525 

Asn Asp Ser His Val Thr Cys Asn Gin Val Glu He Phe Ser Asp lie 
530 535 540 

Asp Tyr Ser Ala Glu Asp Asp Glu Ala He Val Lys Tyr He Lys Glu 
545 550 555 560 

His Thr Glu Thr Thr Trp His Cys Leu Gly Thr Cys Ser Met Ala Pro 
565 570 575 

Gin Glu Gly Ser Lys He Ala Pro Lys Gly Gly Val Val Asp Ala Arg 
580 585 590 

Leu Asn Val Tyr Glu Val Lys Asn Leu Lys Val Ala Asp Leu Ser lie 
595 600 605 

Cys Pro Asp Asn Val Gly Cys Asn Thr Tyr Ser Thr Ala Leu Leu lie 
610 615 620 

Gly Glu Lys Ala Ala Thr Leu Val Ala Glu Asp Leu Gly Tyr Ser Gly 
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625 



630 



635 



640 



Ser Asp Leu Ala Met Thr He Pro Asn Phe Lys Leu Gly Thr Tyr Glu 



<210> 79 . 
<211> 2348 t 
<212> DNA 

<213> Ogataea minuta 
<400> 79 

aagctttctt tcgcaaacag ctctttggta gaggagaata gagtgcccag ctgataaaga 60 
aggcgcactt taaaagataa tctacatcca ga.aaaataaa aaaataaaac tgaaccggca 120 
tttgcgatta cgtaagccac aaaatttcag gaaactcgta caagatcagg ttggcgaggg 180 
ggctagcgat agaatgtatc agtgttatta gtggctctag gagtagaaaa eaatagaata 240 
aagatccgaa gaaagggagc aagaaggcca cgccagacgt tctagtaggt agcccaatcg 300 
tcaatgtagc tgttcaggtc tttcaacagg ttcttggtct cgtctggact ggagatccaa 360 
caagtcgttg ctgcggttcg actggcatag tcgttggcgc cgagggagct gaactggtcg 420 
ccgacgtgca gggtgttttc gggcttgatg gttcggttgt cgttgcagct gaggaactct 480 
tggaggattt tcaccccgta ggacttgtcg ccaatatcga cccagatgtc cgagcctccg 540 
ttgaaagcgc accatctgat tttcttagag gaagggaagt ggcggagacg tttgtctacg 600 
cgcagaacca cctcttccag ctgctcgcga acgagtttgt agccttcttt ggggaccagt 660 
ccaacggcac gctcctttct gatgatgatg gcatgcaatg aaagtttctt gatcaggtcc 720 
gagaagatct cctgcgcaaa gtccagggtc cggatgatgt cttcctcgga ccagtcgagc 780 
atgttttcaa gcagccattt gtctttggag aagaactcga gtccgccaag ctcgttggag 840 
tagcggaata ggtagtttgc ttcgcctccc atcaccagaa cgttctggcg ctgtctgtcg 900 
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645 



650 



655 



Glu Lys Gly Leu Ala Arg Phe 



660 
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gtgagctttg gggtggtctt cacctcgtct atgagcccct tgagtcgggc gtagtacttg 960 
gagccgtcgg agtagcccgc ggcagtgacg atgccgacat agaggtcttt ggcgagcagc 1020 
ttgatgagat ggggcaagat 'cggcgacgag gcgtcgaagt tggagccgtc atcgtagaga 1080 
gtgatgtctc cgtcaaaagt cacgagctgg agtctgcggt gtacggatgt tttgttgtgg 1140 
aaagtgttgg agagctcgag aagttgcgcc gtgttcagaa tgagccgaat gtcgttgaac 1200 
gagggcgcta caagtctcct tttgctgatt gtgcggcgtc cgtcctcgat gtagaacgcc 1260 
ttctccaggg gcaatcgggt gaagaaacag ccaacggaag gcaccaattg gaccaatctg 1320 
gacatttcag gcattcccgc ctgggtcatc tcgatgttgt cgttgatcag cagctcgagg 1380 
tcatggaaga tttccgcgta gcgtcgcttc gcttccgaat tcaccatgag gtcgtccact 1440 
gcggagatcc cattggactt gactgcatag agaacaaacg gggtggccag caagcccttg 1500 
atccactcaa tcagtccgtc tcggcggtgc tccttgagcg cgtactcgac tctgtatctg 1560 
gttgtcattt gcgggagggg tgtaaagcag ctcagccggt gactgtgcaa ggacgaacgg 1620 
ttcctacttg aatgctaggc tggctaattg ggtatggcac aaacggcaca aacggcagat 1680 
gactgcaaat gacgacggta aacagaatcc actcagctgg cactaactgg gtgtagacta 1740 
agagttcgag ccggggaggg agtgacgatg cagccagaaa aagagccggt acgcaatcag 1800 
ggaaatagcc gtcaaaagaa aaacagaagg ggctgcagtt ttgctgccgc ccgccgcgcg 1860 
cccgcgctgg ctttccccgg ccggggaggc agccggctaa agaaaatagc ' ctatttcgat 1920 
ttcgcgtagc ccctcggttg cctattgagg gttacttttc gctccctctt ttgggccaac 1980 
tgacagtttg tggggtaaca acggtgtccg aggccagcta ttcggcaaac aatagacaga 2040 
ttagagacct actacggagt ttcagtgtct tcggaagctg cacagcccga a'tgtcggagc 2100 
ccgtgtgacg acacccccgc atggcttttg gcaatctcac atcgcccctc cctgcgtctc 2160 
cactctgggc atgagcagtg gtgtgcctgg- tgtatctctg gcccccgcgg ggcagacagc 2220 
aaactgcgta taaatagcta cttccatctc ctacttgttg caccattgcc atagtaagaa 2280 
aagaagcaga tcactcaact tgttcaaaga ctcttgtgtt ctgttacgac ttacgactta 2340 
cgaaaaaa 2348 



<210> 80 



<211> 802 



<212> DNA 
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<213> Ogataea minuta 
<400> 80 

gagacggtgc ccgactcttg ttcaattctt ttggttcttt ttctgttttt ctctacgatt 60 

ctacttgatg atgtatgacg agtgaagatt gtgttttttt ctctctatag ttttgactgt 120 

aatgaaaata gtctacatga atgaaagaga tagctgacca atacggggcg tctggtcacg 180 

tgatgtatca cgtgatcttt aagttttcga aatgactaaa tttataacga aaaaaagagt 240 

ctaaatgaaa aaaaatcgat ctctgccaaa gactcatcga taggctaact caggaagcat 300 

tccgagcaac gcataatgcc ctcaaccaca-gtctcagaga tgcgcaaaaa ggtgctgatg 360 

atcgacaatt acgactcgtt cacatggaac ttgtacgagt atctttgtca agagggagcc 420 

gatgtcgagg tctatcgtaa cgacaagatc acaattgaag aaatcgagga aatgaagcct 480 

gacattatag tgatttcgcc aggccccgga catccgagat cggactctgg tatctctcga 540 

aagactattg agattttcaa gggccggatt cctgtttttg gagtgtgcat gggccaacag 600 

tgcatttacg aggttttcgg gggagacgtt gagtacgctg gtgaaattgt tcacggaaaa 660 

acctcttctg tgacccacga caatcgtgga gtcttcaaga acgttccgca gggagttgct 720 

gtgacgagat accattcgtt ggctggaacg ctaaaaactt tgcccagcga gttggaggtg 780 
actgcccgta ccactaacgg ta 802 

<210> 81 
<211> 30 

<212> DNA - 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 0AP5 for production of 
expression cassette with A0X1 gene promoter and terminator 



<400> 81 

ctgcagcccc ttctgttttt cttttgacgg 
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<210> 82 
<211> 90 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 0AP3 for production of an 
expression cassette with A0X1 gene promoter and terminator 

<400> 82 

cccccggatc caggaacccg ggaacagaat ctagattttt tcgtaagtcg taagtcgtaa 60 
cagaacacaa gagtctttga acaagttgag -90 

<210> 83 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 0AT5 for production of an 
expression cassette with A0X1 gene promoter and terminator 



<400> 83 



cccccccgga tccgagacgg tgcccgactc ttgttcaatt cttttgg 



47 



<210> 84 



<211> 33 



<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 0AT3 for production of an 
expression cassette with A0X1 gene promoter and terminator 

<400> 84 

cccataatgg taccgttagt ggtacgggca gtc . 33 

<210> 85 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer HGP5 for amplification of 
a gene conferring resistance against hygromycin B 

<400> 85 

gtcgacatga aaaagcctga actcaccgc 29 

<210> 86 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer HGP3 for amplification of 
a gene conferring resistance against hygromycin B 

75/109 



WO 03/091431 



PCT/JP03/05464 



<400> 86 



actagtctat tcctttgccc tcggacg 



27 



<210> 87 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer for amplification of 
5' -region of a. -mannosidase gene 



<210> 88 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer for amplification of 
5' -region of a -mannosidase gene 



<400> 87 



ggggggtcga catggtggtc ttcagcaaaa ccgctgccc 



39 



<400> 88 



ggggggcggc cgcgtgatgt tgaggttgtt gtacggaacc ccc 



43 



<210> 89 
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<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer for amplification of 
Saccharomyces cerevisiae SUC2 gene 

<400> 89 ■ 

ggggactagt atgcttttgc aagctttcct tttccttttg 40 

<210> 90 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer for amplification of 
Saccharomyces cerevisiae SUC2 gene 

<400> 90 

ccccagatct tattttactt cccttacttg gaacttgtc 39 

<210> 91 

<211> 711 

<212> DNA 

<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 7.. 711 

<400> 91 

ctcaccatga gggtccccgc tcagctcctg gggctcctgc tgctctggct cccaggtgca 60 

cgatgtgaca tccagatgac ccagtctcca tcttccgtgt ctgcatctgt aggagacaga 120 

gtcaccatca cttgtcgggc gagtcaggtt attagcagct ggttagcctg gtatcagcag 180 

aaaccaggga aagcccctaa gctcctgatc tatgctgcat ccagtttgca aagtggggtc 240 

ccatcaaggt tcagcggcag tggatctggg acagatttca ctctcaccat cagcagcctg 300 

cagcctgaag attttgcaac ttactattgt caacaggcta acagtttccc tccgacgttc 360 

ggccaaggga ccaaggtgga aatcaaacgt acggtggctg caccatctgt cttcatcttc 420 

ccgccatctg atgagcagtt gaaatctgga actgcctctg ttgtgtgcct gctgaataac 480 

ttctatccca gagaggccaa agtacagtgg aaggtggata acgccctcca atcgggtaac 540 

tcccaggaga gtgtcacaga gcaggacagc aaggacagca cctacagcct cagcagcacc 600 

ctgacgctga gcaaagcaga ctacgagaaa cacaaagtct acgcctgcga agtcacccat 660 

cagggcctga gctcgcccgt cacaaagagc ttcaacaggg gagagtgttg a 711 

<210> 92 .: 

<211> 234 

<212> PRT 

<213> Homo sapiens 

<400> 92 

Met Arg Val Pro Ala Gin Leu Leu Gly Leu Leu Leu Leu Trp Leu Pro 
15 10 15 

Gly Ala Arg Cys Asp lie Gin Met Thr Gin Ser Pro Ser Ser Val Ser 



20 



25 



30 
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Ala Ser Val Gly Asp Arg Val Thr He Thr Cys Arg Ala Ser Gin Val 
35 40 45 

He Ser Ser Trp Leu Ala Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro 
50 55 60 

Lys Leu Leu He Tyr Ala Ala Ser Ser Leu Gin Ser Gly Val Pro Ser 
65 70 75 80 

Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He Ser 

85 90 95 

Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin Gin Ala Asn 
100 105 110 

Ser Phe Pro Pro Thr Phe Gly Gin Gly Thr Lys Val Glu He Lys Arg 
115 120 125 

Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu Gin 
130 135 140 

Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr 
145 150 155 160 

Pro Arg Glu Ala Lys. Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser 
165 170 175 



Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 
180 185 190 
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Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys 
195 200 205 

His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Ser Pro 
210 215 220 

Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
225 230 

<210> 93 
<211> 1428 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 1. . 1428 

<400> 93 

atgggttgga gcctcatctt gctcttcctt gtcgctgttg ctacgcgtgt ccagtctgag 60 
gtgcagctgg tggagtctgg gggaggcctg gtcaagcctg gggggtccct gagactctcc 120 
tgtgcagcct ctggattcac cttcagtagc tatagcatga actgggtccg ccaggctcca 180 
gggaaggggc tggagtgggt ctcatccatt agtagtagta gtagttacat atactacgca 240 
gactcagtga agggccgatt caccatctcc agagacaacg ccaagaactc actgtatctg 300 
caaatgaaca gcctgagagc cgaggacacg gctgtgtatt actgtgcgag agatcggatt 360 
attatggttc ggggagtcta ctactactac ggtatggacg tctggggcca agggaccacg 420 
gtcaccgtct cctcagctag caccaagggc ccatcggtct tccccctggc accctcctcc 480 
aagagcacct ctgggggcac agcggccctg ggctgcctgg tcaaggacta cttccccgaa 540 
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ccggtgacgg tgtcgtggaa ctcaggcgcc ctgaccagcg gcgtgcacac cttcccggct 600 
gtcctacagt cctcaggact ctactccctc agcagcgtgg tgaccgtgcc ctccagcagc 660 
ttgggcaccc agacctacat ctgcaacgtg aatcacaagc ccagcaacac caaggtggac 720 
aagaaagttg agcccaaatc ttgtgacaaa actcacacat gcccaccgtg cccagcacct 780 
gaactcctgg ggggaccgtc agtcttcctc ttccccccaa aacccaagga caccctcatg 840 
atctcccgga cccctgaggt cacatgcgtg gtggtggacg tgagccacga agaccctgag 900 
gtcaagttca actggtacgt ggacggcgtg gaggtgcata atgccaagac aaagccgcgg 960 
gaggagcagt acaacagcac gtaccgtgtg gtcagcgtcc tcaccgtcct gcaccaggac 1020 
tggctgaatg gcaaggagta , caagtgcaag gtctccaaca aagccctccc agcccccatc 1080 
gagaaaacca tctccaaagc caaagggcag. ccccgagaac cacaggtgta caccctgccc 1140 
ccatcccggg atgagctgac caagaaccag gtcagcctga cctgcctggt caaaggcttc 1200 
tatcccagcg acatcgccgt ' ggagtgggag agcaatgggc agccggagaa caaictacaag 1260 
accacgcctc ccgtgctgga ctccgacggc tccttcttcc tctacagcaa gctcaccgtg 1320 
gacaagagca ggtggcagca ggggaacgtc ttctcatgct ccgtgatgca tgaggctctg 1380 
cacaaccact acacgcagaa gagcctctcc ctgtctccgg gtaaatga . 1428 

<210> 94 

<211> 475 ■ : . 

<212> PRT 

<213> Homo sapiens 
<400> 94 

Met Gly Trp Ser Leu He Leu Leu Phe Leu Val Ala Val Ala Thr Arg 
15 10 15 

Val Gin Ser Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Lys 



20 



25 



30 . 



Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe 
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35 40 45 

Ser Ser Tyr Ser Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 
50 55 60 



Glu Trp Val Ser Ser He Ser Ser Ser Ser Ser Tyr He Tyr Tyr Ala 

65 70 75 80 

Asp Ser Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Asn 

. 85 . • 90 • 95 



Ser Leu Tyr Leu Gin Met Asn Ser 
100 

Tyr Tyr Cys Ala Arg Asp Arg He 
115 120 



Leu Arg Ala Glu Asp Thr Ala Val 

105 110 

He Met Val Arg Gly Val Tyr Tyr 
125 



Tyr Tyr Gly Met Asp Val Trp Gly Gin Gly Thr Thr Val Thr Val Ser 
130 135 140 

Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser 
145 150 155 160 

Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp 
165 170 175 

Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr 
180 185 190 
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Ser Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr 
195 200 205 

Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin 
210 215 220 

Thr Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 
225 . 230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro 
245 250 255 

Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro 
260 265 270 

Pro Lys Pro Lys Asp Thr Leu Met He Ser Arg Thr Pro Glu Val Thr 
275 2S0 285 

Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn 
290 295 300 

Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg 
305 310 315 320 

Glu Glu Gin Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val 
325 330 335 



Leu His Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser 
340 345 350 
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Asn Lys Ala Leu Pro' Ala Pro He Glu Lys Thr He Ser Lys Ala Lys 
355 360 365 

Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Asp ' 
370 375 380 

Glu Leu Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe 
385 390 395 400 

Tyr Pro Ser Asp He Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu 
405 410 415 

Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe 
420 425 430 

Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly 
435 440 445 

Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr 
450 455 460 

Thr Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys 
465 470 475 

<210> 95 
<211> 7 
<212> PRT 

<213> Saccharomyces cerevisiae 
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<400> 95 



Val Gly Phe Leu Asp His Met 



1 



5 



<210> 96 

<211> 7 

<212> PRT 

<213> Saccharomyces cerevisiae 

<400> 96 

Pro Ser Thr Lys Gly Val Leu 
1 5 

<210> 97 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer PHI5 for amplification of 
Ogataea minuta HIS3 gene 



<400> 97 



tnggnttyyt ngaycayatg 



20 



<210> 98 



<211> 20 



<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer PHI3 for amplification 
Ogataea minuta HIS3 gene 

<400> 9S 

arnacnccyt tngtnswngg - ' 20 

<210> 99 
<211> 3S31 
<212> DNA 

<213> Ogataea minuta 

<220> 

<221> CDS 

<222> 1839.. 2552 

<400> 99 

ctgcagatgc gccgttgctg ctgagcaaag tgaaagagca cagagccaaa attgcctctg 60 
ttttggaaca gattgacccg aagttggagg aactgaacaa agaaaaagaa gctcacattt 120 
ccagtgaaga tatccgggac ggttggaaca gctcttttat caacaagaaa tctgagattg 180 
aggaaccagc gtccaacaca aaaagcgact ctgcttcgtc tgtcaagaag accaaggcga 240 
tagagaccat taacagtcca aaattgtcga aagaaccgac cccgtccaaa ccgttagacc 300 
aattgggcga gctggaactg ttggaagaga ccgaacgatt cgcccagatc tcgtctcaag 360 
acctgcttaa atcgatcaag tttcttgaga gacatctata catagtgagc gagcagcaga 420 
aggacgcgtt gatgatgaag tgttttgact acgagctgga cggtgactcc cagcgtgcca 480 
aacagagcgt tcaccaggcg ctgattctgc aatatttgga tgatctgttc aaagccgctg 540 
gcggcccgcg cgccagtcca caccagaagg agcaggctat tggtctgttc attgggaaac 600 
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tgcttgacaa 


aacgacgcct 


gcctcgcggg 


cttttgaggc 


cgattggaag 


aagacttatc 


660 


atcacattgt 


ttccagatgc 


gagattatca 


agcaagaaca 


cgaacaagag 


ggccaagaag 


720 


agcccgaggg 


ggttgaacag 


atacagctga 


gatccatgga 


ccccaactcg 


gagctggtga 


780 


tcaacctgcc 


ttcccagaaa 


accccggagt 


acgaggcttt 


caagcaacta 


ccggagccaa 


840 


tgcagaaggc 


gattgaaacc 


gaaaaattgg 


acgaaatcaa 


ccgtgttttt 


gcctccatgt 900 


cggtggaaga 


cgccgagggt 


gttttggaac 


tgtttgaccg 


ctgcggggtg 


attcagatac 


960 


aggcattgct 


ggagaacgag 


gaggagttca 


accagttgaa 


acatgagtac 


gagggagaac 


1020 


ctcttgagca 


aatcgaagaa 


gagcgaccac 


aaaccg.caaa 


ggaagagaac 


gcttttatac 


1080 


aaacagctga 


ccttgttgat 


tgaagctcaa 


attacgctaa 


acgatatata 


catgtcaatt 


1140 


gcacttaatc 


catatttttg 


agaggaggca 


ttctaagaat 


ctctctagtt 


ttgttctcgc 


1200 


tcattgcttg 


cagttttttg 


agaatctcag 


agttcttcca 


cttctcaggc 


agcggggtga 


1260 


cgtcgtactt 


cttagtggcc 


caatagtaga 


tatcccagtc 


cggctcatca 


agcaatttgt 


1320 


cgta.ctcttc 


cagttggtcc 


gcagtcatcg 


aagccaagta 


cttgtccgca 


aatctactca 


1380 


agagcaagtc 


agattcaaga 


attccccgct 


ttcttgactg 


gtaaaccagt 


cgtcttctct 


1440 


ttgtttccac 


gtcctcattg 


tctcttggga 


ttggctcgac 


cctgatcttc 


agctctccgt 


1500 


cctcctcagt 


taacatggtg 


tgggtgtttg 


ctccctggcc 


aagtctggcc 


accgaggttg 


1560 


cgaatcttct 


tctggtcaca 


aatcctgtca 


gcttaagcat 


tttgtccgtt 


gtgagaccaa 


1620 


aactagcctt 


gtctagggag 


tagttaacat 


ttgtaatgga 


tcttttcgtt 


ttctttgctc 


1680 


ggaaagcagc 


gacagatcgg 


ccgaccgcga 


ctatcgacgc 


gacgataggg 


attttttttt 


1740 


ttaaaaaaaa 


gagtcaaaaa 


aacatcctga 


agagaaactt 


ttcatgggga 


accatgtcca 


1800 


ggagtcatcc 


gacggtatcc 


ctttacagtc 


aaaaaaacat 


gtctgaagag 


aacaagaaac 


1860 


gcaagctgga 


aaacggcaca 


aatgatgcaa 


aagctgctcg 


gtttgcagag 


gtgagaagag 


1920 


tcaccaatga 


aaccagcatc 


cagatcatcc 


tgaacttgga 


tggtggactc 


atcgagtgca 


1980 


aagagtcgat 


tcttggggcc 


acttacgaaa 


aagagagcca 


cgcggcacaa 


aatacgagtg 


2040 


cacaggtgat 


ctccatcaag 


acaggactcg 


gtttccttga 


ccacatgctg 


cacgcccttg 2100 


ccaaacactc 


tggttggtca 


ttgattgtcg 


agtgtattgg 


ggacctgcac 


attgacgacc 


2160 


atcacactgc 


ggaagacgtc 


ggcattgcac 


tgggtgaaac 


gttcaagcgc 


gcgttgggac 2220 


ctgtcaaagg 


tctcaaacga 


ttcgggcacg 


catacgctcc 


actggatgag 


gctctaagtc 2280 


gcgcggttgt 


tgatctgtct 


aaccgtccat 


ttgccgtggt 


ggaacttggc 


ctgcgcagag 


2340 
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aaaaaatcgg ggaccttagt tgcgagatga ttccccacgt tttggagagt tttgctacga 2400 
gcgcgcacat cacgatgcac gtcgattgtc ttcgcgggtt caatgaccac caccgcagtg 2460 
aaagcgcgtt caaggcattg gccgttgcca tcagagatgc cacgagctac acgggccgtg 2520 
atgatgttcc aagtacgaag ggagttttga tgtaactcca cacccaggag cttctactaa 2580 
tattcagccc tctccctctt catcgtgtac agtttaacca tactagacta accaactccg 2640 
ccgtattctc acgtagccgc aaccgaaagc cctggaaagc agatagtcta accaatcgag 2700 
agattgggta ttcaaatatc gattttaggc aattgtaccc gtcttgagcc gccacccgcc 2760 
actacagcca caatttgcac cgccaccgaa aaatttcctg cgtcgcgctt tgtttcccct 2820 
ccgtttgtct tgtcctttct tgtcttaatg cgtctggttt gcttgttttc ttctttgccg 2880 
tagtctgatc ccacacgcag ccgtcaactg aaccgccaca ggaatcccgc ttcgtcgcga 2940 
atgt'tgaacg gtccgtcggt agactcttct tcttcgaccc cgtctgccct gtataaagag 3000 
gcaggcgctc tgggagttta ttcctactac tacgagaacc agcgtgagtc cttcaacaga 3060 
gcccgtctga cctcccctat acgatccggt tctcctcaaa ctaagctcac tgccagaccg 3120 
tccaaggtgt ccgaggtttc cggcccaatc aacttaatca actcaatgtc gtctcctata 3180 
ccccatttgc cctcttctcc gacgcctcgc atgtcccgtt ct.aacggcca cgccaatgga 3240 
caatctccca caactccgtt caagacagca gatatgccgt cacgagtcag cacgccgtcg 3300 
tcgtactcgc gacagctggg ctcgcc'gccc aaactcgacc tggtgaaccc tgccccggtg 3360 
acggacgagg aactcatggt gacgagccgc gagctcattc aggacgcaat ' gcgctcaaga 3420 
agcgtacaca gagagccttc gcagctggaa ctgtggatca aagtgatgtc gttgctcacg 3480 
ggaaaagata aggtagggaa gtgtcttcaa tacggaatac ggattctgat cgcatactcg -3540 
gtccgggcca ggaagacgcc cttcctgtcc gacttcaagc tcacgggcgt tgaettcacg 3600 
gggtccaaag agaacgttct gttgcagctg gtgcggaaac cagaactgct ggtgattcfg 3660 
tttctaggcc agtttgagtc caaactcgtc gggcttacca aaatcctctc catctacaga 3720 
cagatgctgc gagccggtac tgttccgttc aaggtaatca aactgttcgg cagactgaca 3780 
gactcagttc agatactcag cacaaacgat aaggcgtctt tgaagctgca g 3831 

<210> 100 
<211> 238 
<212> PRT 
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<213> Ogataea minuta 
<400> 100. 

Met Ser Glu Glu Asn Lys Lys Arg Lys Leu Glu Asn Gly Thr Asn Asp 
15 10 15 

Ala Lys Ala Ala Arg Phe Ala Glu Val Arg Arg Val Thr Asn Glu Thr 

20 ' 25 30 : . 

Ser lie Gin lie lie Leu Asn Leu Asp Gly Gly Leu He Glu Cys Lys 
35 40 45 

Glu Ser He Leu Gly Ala Thr Tyr Glu Lys Glu Ser His Ala Ala Gin 
50 55 60 

Asn Thr Ser Ala Gin Val He Ser He Lys Thr Gly Leu Gly Phe Leu 
65 70 , 75 80 

Asp His Met Leu His Ala Leu Ala Lys His Ser Gly Trp Ser Leu He 

85 90 95 

Val Glu Cys He Gly Asp Leu His He Asp Asp His His Thr Ala Glu 
100 105 110 

Asp Val Gly He Ala Leu Gly Glu Thr Phe Lys Arg Ala Leu Gly Pro 
115 120 125 

Val Lys Gly Leu Lys Arg Phe Gly His Ala Tyr Ala Pro Leu Asp Glu 
130 135 140 
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Ala Leu Ser Arg Ala Val Val Asp Leu Ser Asn Arg Pro Phe Ala Val 
145 150 155 160 

Val Glu Leu Gly Leu Arg Arg Glu Lys He Gly Asp Leu Ser Cys Glu 
165 170 175 

Met He Pro His Val Leu Glu Ser Phe Ala Thr Ser Ala His lie Thr 
180 185 190 

Met His Val Asp Cys Leu Arg Gly Phe Asn Asp His His Arg Ser Glu 
195 200 205 

Ser Ala Phe Lys Ala Leu Ala Val Ala He Arg Asp Ala Thr Ser Tyr 
210 215 220 

Thr Gly Arg Asp Asp Val Pro Ser Thr Lys Gly Val Leu Met 
225 230 235 

<210> 101 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer DHI5 
<400> 101 

ggcccaatag tagatatccc 20 
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<210> 102 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer DHI3 
<400> 102 

cacggcccgt gtagctcgtg g 21 

<210> 103 
<211> 8 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 103 

Ala Val Gly Gly Pro Lys Trp Gly 
1 5 

<210> 104 
<211> 7 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 104 

Ala Ala Met Met Leu Lys Leu 
1 5 
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<210> 105 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of 'Artificial Sequence: primer PLE5 for amplification of 
Ogataea minuta LEU2 gene 

<400> 105 

gcngtnggng gnccnaartg ggg 23 

<210> 106 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<220> ) 
<223> Description of Artificial Sequence: primer PLE3 for amplification of 
Ogataea minuta LEU2 gene 



<400> 106 



naryttnarc atcatngcng c 



21 



<210> 107 



<211> 5615 



<212> DNA 



<213> Ogataea minuta 
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<220> 

<221> CDS 

<222> 1606. . 2694 



<400> 107 



ggatccttcc 


tcgagatccg 


ggctgttttt 


caccgaccca 


tccgatcctg 


acttcaaccc 


60 


cgacttctcc 


tacgacagac 


tgtgggagtg 


tctcgtcagg 


gcagaacaga 


tgaccggtga 


.120 


cgagaaagag 


ctgctcaggt cgaacgccaa ttacgttgcg 


aaaaccaagt 


ttggaaagaa 


180 


ggttttcggc 


gaacaatgga 


ccaagctgct 


cgcctttgtt 


gtcagcttgg. 


agctgtacaa 


240 


aagacgccag 


cgggggaagg 


tggaggagct 


gtattgaata 


aggaatgagg 


agaatggttt 


300 


tgg.aaagagc 


cagtttatac 


atccgtacac 


cggatctaac 


aactgttttc 


acgaaa.tgca 


360 


cgacttttca 


attttttttt 


ttacttctaa 


aattttttat 


ctctaaaaag 


ctgtagatct 


420 


a.agggtatgt 


gtgttgtatt 


tgcagcagtc 


cacttagcaa 


gaacacacac 


acacgaatga 


480 


■ctgaagttgg 


ccagaaactg 


aacagcaacc 


ctgaggttct 


tctcaagaag 


agaaagcaag 


540 


ccgacagact 


ccgtctggag 


aaacaggaac. 


aggctagaaa 


gagaatcgaa 


gacaaaaaga 


600 


gaaagaagca 


ggaaagaaga 


aaggccaagt 


tcatcagagc 


agagacacta 


gttgccaagc 


660 


acagaaccac 


tgaaagagag 


cagtacagag 


tgaaaagagt 


gacgcagaac 


gagaaaatca 


720 


agaccgaaac 


cgagtccgcc 


aagcaagaag 


cagctgggga 


ggatgaatcc 


aagcttcttt 


780 


tcgttgtgag agtgccaggc ccgcacggtg ccaaggttcc 


aggaaaagcc 


agaaaggttc 


840 


tgtcgttgct 


gcgtcttgaa 


cacacctaca 


cgggaacttt 


cataaagtcc 


aatgccacca 


900 


taagacctct tttgagactg atcaacccgt 


atgtggtgat 


tggaactcct 


tcgctggcca 


960 


ctgtgagaaa 


cctgatacaa aagagagcta 


cagtgaccgt 


gacaagcgaa 


gacggttctg 


1020 


ctagagaggt 


taacttggat 


gacaacaatc 


tcatcgaaga 


gaaattggga 


gagtgtggta 


1080 


tcatttgcgc 


agaagatctt 


attcacgaga 


tcgtctcttt 


gggagactac 


ttcaagccct 


1140 


ccgtcaagtt 


cctgaatcct 


ttccaactga 


acgctcctgt 


ccacggctgg 


ggtccgctca 


1200 


gtaagttaaa gagactcgag ctgagagaag agagcaagaa 


ccacaaggtg 


aacaacgctg 


1260 


gaaacgctcc tttgaacgag gttgacattg 


accagttcat 


cgctgagcag 


atctgagggt 


1320 


atttaagtaa gcatgttcga gtaatgacaa gatctgtcca 


cagtaagatt 


tgaaataatg 


1380 
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gtctatcaat ctcgcgtcga tcgacgcgcg acgcctggcc gcttccctcc ggtttccggc 1440 
gccgccagct cctgcgaccg gagaatattg ttttttatct tgatttttcg aggggattga 1500 
gcattaattt ttcacccaca aaatagctag atttcggttt tcaggagcta cagagtcatc 1560 
gtgaacagaa ttgtgacctt ttatcgcagc ttttttacat tcagaatgac cacaaagaac 1620 



ctcaacgcca tttcggccgc caagccggaa atcaagttca acttcgaaca ccacttgatc 1740 
gggggtgctg ctatcgacgc cactggccag ccaatcacag acgcggctct cgaggcttcc 1800 
aagaaagcag atgctgtcct gctaggatct gtcggaggtc ctaaatgggg tactggtcaa 1860 
gttcgtcctg agcaagggtt gctgaagatc agaaaagagc tcaacttgta cgccaacctg- 1920 
agaccgtgca gctttgcatc ggacgccttg ttggacctgt cgcctctgiaa gccggaaatt- 1980 
gtcagaggta ccgacttcgt tgttgtcaga gagcttgttg gaggaatcta cttcggtgag 2040 
agaaaggagg acgacggatc aggattcgct tccgacactg aggcctactc cgtgcccgaa 2100 
gttcaaagaa tcaccagaat ggctgctttc atggccctgc aaagtgaccc ccctctccca 2160 
gtgtattcgc tggacaaggc caacgttctg gcttcgtccc gtctgtggag aaagaccgtt 2220 
gaagagacta tcaagaacga gtttcctcag ctgaagctgc aacaccatct gatcgactca 2280 
gccgctatga ttttggtgaa gtccccaacc aaactgaacg gtgttgttct cacatccaac 2340 
atgtttggag acatcatctc tgacgaagct tcggtgattc ccggttcgct gggcctgctg 2400 
ccgtccgcat ctctggcttc tcttccagac tccaacgagg cgttcggtct gta'cgagcct 2460 
tgccacggtt ccgctcccga tctcgccaaa ggactggtga acccgctggc taccattctc 2520 
tcggccgcca tgatgctcaa gttgtcgctc aaccttgttg aggagggccg tgccgtcgaa 2580 
aaggctgtca gagccgttct ggaccaaggc atcatgactg cagacttggg cggatcgtcg 2640 
tcgaccactg aggttggaga cgctgttgcc aaggaagtga ccaaattgct gggctaaagg 2700 
ggtcaatttt gtcctgatcc ggcagagatt gttccatgca ctcgtcgagc ttccatgcag 2760 
cgagcaacct gtccctcgtg tagcagctgt tcccctcaat atactggtgg ctcgccctat 2820 
agtgagccag ctaccttctt ctataaatag cctaggcata cccgaatttc ttttgctccc 2880 
cgagaacgta gcccgacgcg cgcctgaaca atagaaaaaa ttacaaacaa tagcggctcc 2940 
aaaaatctat tgtcggagcg ttttttcaca gacttctatt cgaggtttgg tgatcctgtt 3000 
tgtttttgtt ttgttgttat atgtcaccaa tcgtgaaatt ttcagacccg tagttcaacc 3060 
ttgtcaggaa ctcaatcact ggttcaagtc tctgccaatc gcctcggcac aaactactcc 3120 



atagttttgc 



tgcctggtga ccacgttggc 



ccggaggttg ttgacgaggc cgtcaaagtt 1680 
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cgtcctgcat ttcatcctct tctgaacggt 
tgtacgacaa gcaacacaag gcaacaccag 
ggcaccgtca attggaacaa cctgttgaac 
gaaacagcat cctcattcaa cacccctgac 
gatgacagcc tcctgcggtc cgttcaagct 
gacgctcgca tccttctcgt acttgggatg 
gctgtcggac tcatacatct accagtcctc 
cgcggtggcc gctctgaccg gcggaaataa 
cgacgccagc agcgactcca aatgctcgac 
cggtggcagc gggtactatt • cggtgtatgt 
gggatcgtcc agttcggaaa ccaccagctc 
ctccacctcg agttctacct cgacaagttc 
tttgacgagc tcctcctctt cgtcatcatc 
ggtctctgaa ataatcacca gttcggcttc 
atcaacctca tcatcgtcct cgtcgtcatc 
gtcctctacg acgtcgacca cgtcttcatc 
aacctcagcc acaacatcag ccacaccaac 
ctctgtttcc cccggagcaa ccatgacgtc 
ttccagcggg ctgccatcgc cctcttcatc 
ctcaaaaaaa tcctcctcgg gctccaagct 
cggagtcatt gtgggggtgg cagcgctcat 
gaaaagatcc gacgaagacg aagagtccat 
ttcgccgaga aacaccgttt ctggcctcac 
cttcctcagc gaagacgacc gactggacca 
ttcgctgcca aacgcagcgg ccggagcccc 
aggtcttagg gttatcaatc cagacctcag 
tttacggtcg agttcgtccc gttaattagc 
acatccttcc gaacactgtg tattttaaac 
cttggtggcg ttctggagtg cttcttctta 
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tgcaagcact 


ctaggtgttt 


cagaattggc 


3180 


gcagcaaaac 


aagcaaaaca 


agcaaaaaaa 


3240 


ggtgatagca 


tcacgtgtct 


ttgaaataac 


3300 


ctgtatttct 


ctggctccat 


cctgttccac 


3360 


cgcactcagt 


ctgctgttga 


tctcgccatc 


3420 


ctactcctcg 


tcttccgtcg 


actccctttc 


3480 


gttgcattgc 


aagtcgcaat 


gctccggtag 


3540 


gtgctattgc 


ggcgactcag 


taccttctgg 


3600 


cgcctgtgac 


gggtacggct 


cggaaaactg 


3660 


tgactccgac 


caagaaaacg 


actcgagctc 


3720 


cacaagctcc 


acaagctcca 


ccagctccag 


3780 


aagctcaaca 


agctcaa.cga 


gctcgtccac 


3840 


aacttcgacg 


tcgagctcct 


ccgcttcgtc 


3900 


ggaaacctcc 


tccgagacaa 


ccagcacccc 


3960 


gtcatcgtca 


tccgcatcgt 


cctcttcgtc 


4020 


ctcgtcctcg 


atctcctcta 


catcagccac 


4080 


aacaacctct 


ccggctgtgc 


tcgtttccac 


4140 


tctgatctac 


atcacccagt 


cgatctccgc 


4200 


ggcaagcacg 


ggcaacagct 


cggcccacga 


4260 


tagcggagga 


gctatagccg 


gaatagtcat 


4320 


cgcagccgtc 


ctcttttttc 


tgtggtacaa 


4380 


caacgaaaaa 


gacgttccgg 


agatgctcaa 


44.40 


ggccgccgca 


atcggagtca 


ataagttcgg 


4500 


cccgggagca 


aaccgaagat 


tcagcgacgg 


4560 


cggtgaccaa 


tcgcgcaagt 


ccaaagccgg 


4620 


tgacgaagaa 


tgaaaaattg 


tttcttgtct 


4680 


atatccctcc 


ttttgtatgg 


ttattcccca 


4740 


cggcctgtct 


gcttgttagc 


gtctaggcta 


4800 


cgtctttcaa 


tataatcttg 


actacttatt 


4860 
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caagcctcga atagtataat ttggggagat ccagtcacaa gagtacaata tacacaaaaa 4920 
cgctcaaaaa caggtccaat ctccgtactt tgggggtgtc cttaacctta atgaccagcc 4980 
cactcgatcg agttcaaatc aataccctcg ttcaccatct cccacagcgg actcaattct 5040 
ctcaaaaact tgacccgctc ggtcagcgct ttcacaacgt aatcaacctc ctcctcggtg 5100 
gtaaatcttc caacaccgaa tctgatcgaa gagtgggcca aagcgtcgtc cgcacccaac 5160 
gcgtgcaaaa catatgatgg ctccaacgag gcggaagtac atgccgaccc cgaggaaaga 5220 
gcaatgtcct tcaacgccat caacagcgac tcaccctcca cgtacgcgaa cgaaacgttc 5280 
acacagccgg ggtaccggcg tgttttggaa ccgttcagtt gcgtgtgttc catggccaga 5340 
agattgttca tcaatttgtt cgacaaccgg gtgatatgct cgtggtcggc gtcgtactcg 5400 
gattgcatca atctggcagc ctctccaaat ccacaaacaa gcggaggagc cagagttccc 5460 
gacctcagtc ctctttcctg tcctcctccg ttgatcaatg ggtccagccg gacacggggt 5520 
cttcttctca cgtaacaagc acccactccc atcggcccgt aaatcttgtg cgacgaaatg 5580 
gacatcaggt caatgttgca cttgttcaca tcgat 5615 

<210> 108 
<211> 363 
<212> PRT 

<213> Ogataea minuta 
<400> 108 

Met Thr Thr Lys Asn He Val Leu Leu Pro Gly Asp His Val Gly Pro 
1 5 10 15 

Glu Val Val Asp Glu Ala Val Lys Val Leu Asn Ala He Ser Ala Ala 



20 



25 



30 



Lys Pro Glu He Lys Phe Asn Phe Glu His His Leu He Gly Gly Ala 



35 



40 



45 
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Ala lie Asp Ala Thr Gly Gin Pro He Thr Asp Ala Ala Leu Glu Ala 
50 55 60 

Ser Lys Lys Ala Asp Ala Val Leu Leu Gly Ser Val Gly Gly Pro Lys 
65 70 75 80 

Trp Gly Thr Gly Gin Val Arg Pro Glu Gin Gly Leu Leu Lys He Arg 

85 90 95 

Lys Glu Leu Asn Leu Tyr Ala Asn Leu Arg Pro Cys Ser Phe Ala Ser 
100 105 .110 

Asp Ala Leu Leu Asp Leu Ser Pro Leu Lys Pro Glu He Val Arg Gly 
115 .120. 125 

Thr Asp Phe Val Val Val Arg Glu Leu Val Gly Gly He Tyr Phe Gly 
130 135 140 

Glu Arg Lys Glu Asp Asp Gly Ser Gly Phe Ala Ser Asp Thr Glu Ala 
145 150 155 160 

Tyr Ser Val Pro Glu Val Gin Arg He Thr Arg Met Ala Ala Phe Met 
165 170 175 

Ala Leu Gin Ser Asp Pro Pro Leu Pro Val Tyr Ser Leu Asp Lys Ala 
180 185 190 . . 



Asn Val Leu Ala Ser Ser Arg Leu Trp Arg Lys Thr Val Glu Glu Thr 
195 200 205 
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lie Lys Asn Glu Phe Pro Gin Leu Lys Leu Gin His His Leu lie Asp > 
210 215 220 

Ser Ala Ala Met He Leu Val Lys Ser Pro Thr Lys Leu Asn Gly Val 
225 230 235 240 

'J 1 

Val Leu Thr Ser Asn Met Phe Gly Asp He He Ser Asp Glu Ala Ser 
245 250 255 

Val He Pro Gly Ser Leu Gly Leu Leu Pro Ser Ala Ser Leu Ala Ser 
260 265 270 

Leu Pro Asp Ser Asn Glu Ala Phe Gly Leu Tyr Glu Pro Cys His Gly 
275 280 285 

Ser Ala Pro Asp Leu Ala Lys Gly Leu Val Asn Pro Leu Ala Thr He 
290 295 300 

Leu Ser Ala Ala Met Met Leu Lys Leu Ser Leu Asn Leu Val Glu Glu 
305 310 315 320 

Gly Arg Ala Val Glu Lys Ala Val Arg Ala Val Leu Asp Gin Gly He 
325 330 335 

Met Thr Ala Asp Leu Gly Gly Ser Ser Ser Thr Thr Glu Val Gly Asp 
340 345 350 

Ala Val Ala Lys Glu Val Thr Lys Leu Leu Gly 
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355 



360 



<210> 109 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer DL5 

<400> 109 

caggagctac agagtcatcg 

<210> 110 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer DL3 

<400> 110 

acgagggaca ggttgctcgc 

<210> 111 
<211> 8 
<212> PRT 

<213> Saccharomyces cerevisiae 
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<400> 111 



Asp Thr Gly Ser Ser Asp Leu Trp 



1 



5 



<210> 112 
<211> 8 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 112 

Phe Gly Ala He Asp His Ala Lys 
1 5 

<210> 113 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer PLE5 for amplification of 
Ogataea minuta YPS1 gene 



<400> 113 



gayacngght cntcngayyt ntgg 



24 



<210> 114 



<211> 23 



<212> DNA 



<213> Artificial Sequence 



100/109 



WO 03/091431 




PCT/JP03/05464 



<220> 

<223> Description of Artificial Sequence: primer PLE3 for amplification of 
Ogataea mi nut a YPS1 gene 

<400> 114 

ttygghgcna tygaycaygc naa 23 

<210> 115 
<211> 3661 
<212> DNA 

<213> Ogataea minuta 

<220> 

<221> CDS 

<222> 1712. . 3523 

<400> 115 

gaattcacca. gttatctgga cgaggcttgt gtttcagacg agttgctgta cagtcaaatt 60 

tgccaggatt attgctcatt gattgggctt tcttccaact ctcccctgta caacaccctt 120 

ctctctagtt tcattgcact gcccagtttt atcaaatacc acaggatatc caagctttct 180 

ggtaagctca actggacaac ggaaaacgag ctgccgtttg aaatcaatct gccactgttt 240 

ctgcaatttc attctgtgtt tatctgcccc atctccaaag aggagactac tcctacgaat 300 

ccgcctatag ttctgggttg ccatcacatc atatcgaagg agagcgctga caagctattg 360 

aaacagattt tccgggtgaa gtgtccatac tgtccaatga cttggtatga agatcgtctc 420 

aaagaggctc gctttgtgga tatatgattt gaaagattac agcgattgta agacggttat 480 

ttgatacaag ttggttgatt tttcaaggct gtgtaggaaa atttggagta aaaaaaattt 540 

tggatctcaa attaagttat caaaagctac gtaggggctg gccgagacga cagcactgaa 600 

tcaataaacc atcagtgatg agcgacgcac agataagaaa cggagcgcag aagagcaaga 660 
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aggcgaaccg gaggtcgagg aagaagcgga gaaccgagga tttctcgtcg tcttctgaga 720 
gctcagattc agacagagag gaggaagtga aggagagcgt tgaagctacc gaggaggttg 780 
aaacattaga gccagaggcc atggatctgg ctatcgatca gctgaatgtt acaggtgccg 840 
acgcggcaat gacgcaggat ttggacaaga ccagactgaa cttacgccgt cttgatgccc 900 
cgttggaggt gacgaggctt gggcagaccg ttgactccgg acgcgctgca aagttgggcg 960 
ggaacgaact gcagggcgcg cagtccaagg ttgagggggc ccgtaatgag ctgagaaatg 1020 
cttacttggg caagatgttg gggctctaca gtgacgactt ggatgctctc aggcagcaga : 1080 
gcgatttcac cgagaactcg ctgtccatgt tggcgcagct actgaaaaac agcggtaatg 1140 
tgtttgatga cgaggcgctg aagtcattag ttgaatagaa aacaggcaaa taattttggc 1200 
agggccgttt tgccgatgcg atataggctc tgttgccgat acgttcccgg ggagcttccc 1260 
tacggttgct gttctgtcgg tcttggcgag ttttccactt ttgcggccgc acgaagccca 1320 
gactagccag tcataccagc cgtggactcc gcctacttga cggggaaatt tttcccgtgc 1380 
cacttttccc ggggcaaaat aagtggctaa gcagcagaca agaaaaaaag gctcgaaaaa 1440 
gttaaaagaa gtaacagcag aatatatata gccaagtgtg gtttgtcaga. agcaaagcac 1500 
gctaatttga agcattttca cgggtgaaca gcacacaaag atctccaggg gggcgttctg 1560 
gttgtgaatt ttatatagag agcaaaaagg atttagaaat cgccgaaatt tgtttggttt 1620 
agaagtgctt ttattgtgag acgttttcgt gtatcagaag ggcatcttga cactcggtta 1680 
gaatatgagg tgcaaaaaca ttttggaagc aatgttgttg gtggccgggg gcacagtccc 1740 
cgtggcgggg ttgcctgctg gcgagtcgaa ggcaaactcg agtccggggt atctgcgaat 1800 
ggaggccgag atctacagag ggcattcgtt tgagacgtcc caacgcggag gacggccgta 1860 
tatgctggag aagcgagccg aggacggatc ggtgctaatg gagctgcaga acaaccaatc 1920 
attttacaaa gtggagcttg aagtgggttc agacaagcaa aagattggtg tcttagtgga 1980 
tacgggttcg tcagatctgt ggatcatgaa ccaaaacaac tcgtactgtg agtcctcgtc 2040 
ctcgtcctcc aagatgcggg aacgcaaggg cagaaagctg agtgatctca gaaacctgaa 2100 
cttagacgtg agcgaaaaaa acgtgaaggc tgtcggggct gcagagactg aaacgatgac 2160 
cttatcggtg ggagaaggtc tgttttcctg gttcgaaact cagacggacg gcagcggggg 2220 
agaaacagaa acggcttccg gagacagctc cgaggccacc attgattgct ctgtttacgg 22S0 
gacattcgac ccgtcctcct ccgatacgtt caaatcgaac ggaacggagt tttcgatttc 2340 
atacgccgat gacagtttcg ccaagggaac atggggcacc gacgatgtga ccttcaacgg 2400 
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tgtcacggtg gatcaattgt ctatggcaat tgctgatgag accaactcgt cgatgggagt 2460 
tcttggaatt ggactcaagg gcctggagac tacgtactcc ggagacgtga cgaatgcgta 2520 
cacgtacgaa aacttgccgt acaagatgca gtcccaggga ctgatcagca agccggtcta 2580 
ctcggtttat ttgaacgaca gcgagtccag cgctgcgtcg attttgtttg gagccgttga 2640 
ccacgacaag tacactggaa cgttgacgtt gctcccgatc atcaacacgg ccgaaagcct 2700 
gggctactcg acccccgtca gactcgaggt gacactgtca aagctttaca cgggctcgtc 2760 
ctcgaataaa acggccgtga gcatcgcgtc tggggctgcg gcagctctgt tggacacggg 2820 
aaccacgttg acgtacgttc cttcggacat catctctaca atcgtggacc agtacggctt 28S0 
tcaatacagc agttcggttg gaacgtatgt ggccaagtgc gactcgctcg acgatgctga 2940 
gattgtcttt gacttccagg gaaccaagat atgggt.tccg ttctcgtcgt ttgcggtctc 3000 
actcaccacc aacggaggct cgcagtcgtc gtactgtgcg cttggcttga tggacagcgg 3060 
agacgacacc ttcactctgg gagactcgtt cctcaacaac gtctacttcg ttgccgatct 3120 
agagaacctg cagattgcca ttgctccggc taacctggac tccacgtcgg aggacattga 3180 
agtggtgagc gactcgggaa tcccgtctgc aaagtccgct tctgcctact cttccagttg 3240 
gggtgcgtct ggctccgcgg tggcctcgtc gttgtctgtt caaaccggcg cagaaaccgt 3300 
cacctccacc gatgctggct ccgactccac gggatctgcg tctgggtcgt ccggttcggc 3360 
ctcgtcctcc tcgtccaagt cttctgcgtc ctcctcgtct ggttcgtccg gctcgtcgtc 3420 
caagtcgggc tcgagctcgt ccaagtacgc tgccggaaac gcctggggaa tgagcgtctg 3480 
cagcctggct ttcaccatcg cggtctcggt gttggtgatt ggctaacctg gccgcagccg 3540 
ctttgcttcc atcctgctga ccccgccggt aactctggtc ggattgtatt acatacatac 3600 
atacctccca cgcgtttgat atcacgatgt gacttatttt tctgtgcaca gcccggaatt 3660 
c 3661 

<210> 116 

<211> 604 

<212> PRT 

<213> Ogataea minuta 



<400> 116 
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Met Leu Leu Val Ala Gly Gly Thr Val Pro Val Ala Gly Leu Pro Ala 
1 5 10 15 

Gly Glu Ser Lys Ala Asn Ser Ser Pro Gly Tyr Leu Arg Met Glu Ala 
20 25 30 

Glu lie Tyr Arg Gly His Ser Phe Glu Thr Ser Gin Arg Gly Gly Arg 
35 40 45 

Pro. Tyr Met Leu Glu Lys Arg Ala Glu Asp Gly Ser Val Leu Met Glu 
50 55 60 

Leu Gin Asn Asn Gin Ser Phe Tyr Lys Val Glu Leu Glu Val Gly Ser 
65 70 75 80 

Asp Lys Gin Lys He Gly Val Leu Val Asp Thr Gly Ser Ser Asp Leu 

85 90 95 

Trp lie Met Asn Gin Asn Asn Ser Tyr Cys Glu Ser Ser Ser Ser Ser 
100 105 110 ■•■ 

Ser Lys Met Arg Glu Arg Lys Gly Arg Lys Leu Ser Asp Leu Arg Asn 
115 120 125 

Leu Asn Leu Asp Val Ser Glu Lys Asn Val Lys Ala Val Gly Ala Ala 
130 135 140 



Glu Thr Glu Thr Met Thr Leu Ser Val Gly Glu Gly Leu Phe Ser Trp 
145 150 155 160 
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Phe Glu Thr Gin Thr Asp Gly Ser Gly Gly Glu Thr Glu Thr Ala Ser 
165 170 175 

Gly Asp Ser Ser Glu Ala Thr lie Asp Cys Ser Val Tyr Gly Thr Phe 
180 185 190 

Asp Pro Ser Ser Ser Asp Thr Phe Lys Ser Asn Gly Thr Glu Phe Ser 
195 200 205 

He Ser Tyr Ala Asp Asp Ser Phe Ala Lys Gly Thr Trp Gly Thr Asp 
210 215 220 

Asp Val Thr Phe Asn Gly Val Thr Val Asp Gin Leu Ser Met Ala He 
225 230 235 240 

Ala Asp Glu Thr Asn Ser Ser Met Gly Val Leu Gly He Gly Leu Lys 
245 250 255 

Gly Leu Glu Thr Thr Tyr Ser Gly Asp Val Thr Asn Ala Tyr Thr Tyr 
260 265 270 

Glu Asn Leu Pro Tyr Lys Met Gin Ser Gin Gly Leu He Ser Lys Pro 
275 280 285 

Val Tyr Ser Val Tyr Leu Asn Asp Ser Glu Ser Ser Ala Ala Ser He 
290 295 300 



Leu Phe Gly Ala Val Asp His Asp Lys Tyr Thr Gly Thr Leu Thr Leu 
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305 310 315 320 

Leu Pro lie lie Asn Thr Ala Glu Ser Leu Gly Tyr Ser Thr Pro Val 
325 330 335 



Arg Leu Glu Val Thr Leu Ser Lys Leu Tyr Thr Gly Ser Ser Ser Asn 
340 345 350 

Lys Thr Ala Val Ser He Ala Ser Gly Ala Ala Ala Ala Leu Leu Asp 
355 360 365 

Thr Gly Thr Thr Leu Thr Tyr Val Pro Ser Asp He He Ser Thr He 
370 375 380 

Val Asp Gin Tyr Gly Phe Gin Tyr Ser Ser Ser Val Gly Thr Tyr Val 
385 390 395 400 

Ala Lys Cys Asp Ser Leu Asp Asp Ala Glu He Val Phe Asp Phe Gin 
405 410 415 

Gly Thr Lys He Trp Val Pro Phe Ser Ser Phe Ala Val Ser Leu Thr 
420 425 430 

Thr Asn Gly Gly Ser Gin Ser Ser Tyr Cys Ala Leu Gly Leu Met Asp 
435 440 445 

Ser Gly Asp Asp Thr Phe Thr Leu Gly Asp Ser Phe Leu Asn Asn Val 
450 455 460 
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Tyr Phe Val Ala Asp Leu Glu Asn Leu Gin He Ala He Ala Pro Ala 
465 470 475 480 

Asn Leu Asp Ser Thr Ser Glu Asp He Glu Val Val Ser Asp Ser Gly 
485 490 495 

He Pro Ser Ala Lys Ser Ala Ser Ala Tyr Ser Ser Ser Trp Gly Ala 
500 505 510 

Ser Gly Ser Ala Val Ala Ser Ser Leu Ser Val Gin Thr Gly Ala Glu 
515 520 525 

Thr Val Thr Ser Thr Asp Ala Gly Ser Asp Ser Thr Gly Ser Ala Ser 
530 535 540 

Gly Ser Ser Gly Ser Ala Ser Ser Ser Ser Ser Lys Ser Ser Ala Ser 
545 550 555 560 

Ser Ser Ser Gly Ser Ser Gly Ser Ser Ser Lys Ser Gly Ser Ser Ser 
565 570 575 

Ser Lys Tyr Ala Ala Gly Asn Ala Trp Gly Met Ser Val Cys Ser Leu 
580 585 590 

Ala Phe Thr He Ala Val Ser Val Leu Val He Gly 
595 600 

<210> 117 
<211> 21 



107/109 



WO 03/09] 431 




PCT/JP03/05-I64 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence-* primer DY5 
<400> 117 

ctcaagggcc tggagactac g 21 

<210> 118 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer DY3 
<400> 118 

cgggattccc gagtcgctca cc 22 

<210> 119 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer PDI5 for amplification of 
5' -region of Saccharomyces cerevisiae PDI gene 
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<400> 119 



tctagaatga agttttctgc tggtgccgtc ctg 



<210> 120 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer PDI3 for amplification of 
3' -region of Saccharomyces cerevisiae PDI gene 



<400> 120 



ggatccttac aattcatcgt gaatggcatc ttc 



33 
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